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GENE DELIVERY FUSION PROTEINS 

Technical FieM 

5 Tbt invention relates to the field of gene delivery, more specifically to proteins 

useful for introducing polynucleotides into target cells. Still more specificaUy, the 
invention relates to fusion proteins that are capable of botfa bindiiig to a polynucleotide 
of interest, and of facilitatir^ delivery of the bound polynucleotide to a target ceU, 
especially to a mammalian target cell. 

10 

Background 

Mai^ viruses have been adapted for use as gene delivery vectors for mammalim 
ceils. Viruses have highly efficient mechanisms for entering cells, and in some cases 
also have specific mechanisms for integrating the viral gemmie into the host ceU 

t. 

15 chromosome. The high efficiency of gene transduction afforded by the viral vectors is 
the principal advantage of using a virus-based system for gene delivery. In addition, 
the &a diat die viruses are particulate allows virus-based systems to be considered for 
in vivo gene delivery. These attributes have led to the wide use of viral vectors in 
gene transfer studies. >^nises diat have been used for this purpose inchide 

20 retroviruses, adenoviruses, parvoviruses, papovaviruses, poxviruses and herpesviruses. 
More recently, llie utilily of vhral vectors has led to the use of retroviruses and 
adenoviruses in gene tberqqr applications. 

Although die virus-based delivery systems can give rise to high efficient <tf . . 
genedelivety,diey suffer firom a number of disadvantages. For example, the most 

23 widely used viral system, die retroviral vectors, have been extenavely modified to 

prevent the generation of xeplication-competrat retrovirus (RCR), but since such RCR 
. has the potential to be leukemogenic (see Donahue et al., J. Exp. Med. 176:1 125-1 13S, 
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1992), all retroviral prq)arations for use in gene therapy must undergo extensive 
validation testing to confiim the absence of RCR before use. In addition to these safety 
concerns, retroviral and other viral vectors can place size and sequence constraints on 
the genetic material that can be transferred and on the target cells that can be infected 
5 (see» e.g., Israel & Kaufman. Blood. 75:1074-1080. 1990; Shimotohno & Temin. 

Nature 299.265-268. 1982; Stead et al.. Blood. 71:742-747. 1988; and Bodine et al.. 
Blood. 82:1975-1980. 1993). 

The development of efficieitt non-viral gene delivery (NVGD) systems would 
allow gene transfer/gene therapy studies to be performed in the absence of the 
10 aforementioned limitations of the viral vectors, and could also have the advantages of 
ease of scalability, cost and speed of generatioiL Based on these advantages, non-vinl 
gene delivery systems could also allow more diseases to be treated through gene 
therapy by making iiyectable gene delivery systtms a reality. 

Existing non-viral gene delivery systems can be nyqgihly divided into pl^sieil 
15 and biochemical approaches. The pl^sical methods inchute such trrhnifiM i at 
electrpporation, particle bombardment, scntpe loading and calchm phniitmti 
translation (see. e-g., Fechbeimer et al.. P.N.A.S. 84:8463-8467, 1987; Cheng et al., 
P.N.A.S. 90:4455-4459, 1993; and Kiiegler, M. (ed.). "Gene Transfer and 
Ejqnession, a IjSaontory Manual,* 1990, WJI. Freeman Publishen). The 
20 biocheinical meAods invoNe miung the DNA to be delivered with teage^ 

DEAE-dextran, gramicidin S, Iqwsomes, polyamidoamine polymen, polyamines, 
polybrene, cationic proteins and poly-L-lysine-based conjugates (see, e.g., Kawai &. 
Nishizawa. Mol. Cell. Kol. 4:1172-1174. 1984; Behretal.. P.N.A.S. 86:6982-6986. 
1989; Rose et al., P.N.A.S. Biotedmiques 10:520-525, 1991; Pardridge & Boado, 
25 F.RB.S. Lett. 288:30-32, 1991; Legendre & Sroka. P.N.A.S. 90:893-897, 1993; 

Haensler &. Szoka, Bioconj. Chcm. 4:372-379, 1993; and Wu and Wu, J. Biol. Chem. 
262:4429^32. 1987). 

These different qqnoaches vary in their efficiency of gene deliveiy and in their 
ability to confer long-term (i.e. stable) retention.of transferred «y<pitnctii Howe««r, 
30 the biochemical ^roadies are in general more attractive horn a gene therqiy point of 
view because such qiptoaches have a greater potential for use witbm ii^ectable gene 
deliveiy ^sterns dian do most of die pitysical ap pro a c h es. 
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One problem with the use of conjugates based on poly-L-lysine or otter basic 
polymers, which are assembled via chemical cross-linking, is that the ctemical steps 
required for cross-lmking can be both imprecise and cumbersome. Moreover, it can te 
very difficult to control the stoichiometiy of tte different conjugate components in such 
5 a system, panicularly as more components are added to facilitate gene delivery. 

Summary of the Invention 

In view of the continuing and unmet need for safe, efficient and stable non-viral 
gene delivery systems, tte present invention provides a generalized approach for tte 

10 * modular construction of fusion proteins that are capable of both binding to a 

polynucleotMle of interest, and of facilitatiqg delivery of tte bound polynucleotide to a 
target cell, especially to a human target cell for gene Aeapy. 

Tte proteins of tte present invention, termed Gene Delivery Fusion Proteins 
(GDFPs) conqnise a nucleic acid binding domain (NBD) that contains a con^onent 

15 capable of binding tte targeted micleic acid; fused to a gene delivery domaia (ODD) 
that contains one or more components that mediate or ftcilitate delivery of tte targeted 
nucleic acid to dse target cell. 

As described in detail telow, nucleic acid binding domains can comprise any of 
a number of conqionenls, tiie essential feature of wbkh is that tbey are capable of 

20 binding nucleic acids. A number of such conqxments are known in tte art (see, e.g., 
tte references cited telow), inchiding proteins that bind nucleic ackis in a sequence- 
specific mam»: and proteins ttet bind nucleic acids relatively non-specifically. For 
purposes of discussion and illustration, nucleic acid binding domains can te 
conveniently grouped into either of two bask subsets depending on whether tte nucleic 

25 acid binding donuun does or does not contain an analog of a sequeiKe-speciflc nudetc 
acid binding protein, as described in more dttaSl below. . 

in a first type of gene deUvety fusion protein of tte present mventkm 
(sometimes referred to herein as a *Type-I GDFP"), die nucleic acid binding domain 
contains an analog of a sequence-q^ecific nudeic acid blndiiig protein (sequence- 

30 specific NBP). In a secmd type of gene delivery fusion protein of tiie present 

invention (sometimes referred to herein as a *T^pe-n GDFP"), the nucleic acid binding 
doitnain coiitaiiis an analog of a seq^ieitte-non-specific nucldc add b^^ 
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(sequence-non-specific NBP) and does not contain an analog of a sequence-specific 
NBP. 

Thus, one embodiment of a GDFP of the present invention is a macromolecule 
useful in delivering a urgeted nucleic acid to a target cell, comprising a gene delivery 
5 fusion protein (GDFP), said GDFP comprising a nucleic acid binding domain (NBD) 
that contains a component capable of binding to a cognate recognition sequence in the 
targeted nucleic acid which component is derived from a sequence-specific nucleic acid 
bindiqg protein; fused to a gene delivery domain (ODD) that contains one or more 
camponata that mediate or facilitate deliveiy of the targeted nucleic acid to the target 

10 cell. In addition to the binding conqKment derived from a sequence-q)eci& NBP, the 
mideic acid binding domain of Type-I GDFPs can also contain additional bindnv 
components, as discussed below, which can be derived from either sequeoce-specifk or 
sequrace*non-qiecif]c nucleic acid binding proteins. 

Anodier embodiment of a GDFP of the present invention is a macromolecule 

IS useful in delivering a targeted nucleic acid to a target cell, comprising a geae delivery 
fusion protein (GDFP), said GDFP comprising a nucleic acid binding domain (NBD) 
that contains a con4)onent capable of binding the targeted nucleic acid which 
component is an analog of a sequence-non-specific nucleic acid binding protein; fused 
to a gene delivery domain (GDD) that contains one or more components diat mediate 

20 or facilitate delivery of the targeted nucleic acid to the target cell. 

In one aspect of die invention, the components of the gene delivery domain 
(GDD) that facilitate delivery of die targeted nucleic acid to the target cell are selected 
ham die group consisthig of a binding/taigetiqg component, a membrane-disruptiiv 
conqxment, a transport/localization CQnq)onent and a replicon integration component 

25 inanotheraq^eclof die invention, the various functional 

die GDFP are separated flexible peptide linker sequeooes ("flexoos"), wbidt can 
enhance the ability of the components to adqit ccmfonnations relatively independently 
ofeachod«. 

Another endiodinient of a GDFP of die invention is a recombniant 
30 polynucleotide encoding a GDFP. In a pr efe rre d embodiment of diis type, 

polynucleotide is an expression vector and is arranged so diat die various donoains and 
conqx)nents of the GDFP are cxpttssed as an in-frame fusion product, dierdyy allowing 
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for efficient modular synthesis of the GDFP as a single recombinant product. Yet 
another embodiment is a method of using the above-described recombinant 
polynucleotide to produce a GDFP, said method comprising the steps of causing the 
recombinant polynucleotide to be transcribed and/or translated and recovering a GDFP. 

5 As discussed herein, the preferred method involves the modular synthesis of the GDFP 
as a single protein product. Yet another embodiment is a medKxl of using a GDFP to 
deliver a targeted nucleic acid (tNA) to a target cell, the method comprising the steps 
of contacting the GDFP with the targeted nucleic acid to produce a GDFP/nucleic acid 
complex and contacting said GDFP/nucleic acid complex with the target cell. 

10 Preferably, the tNA is an expression vector. Prefierably, the target cell is a mammalian 
cell. Yet another embodiment is a cell produced by the above-described method of 
using a GDFP and the progeny thereof. 



IS pirief Desffriptiftn ftf ftft Prawliigg 

Figure 1 is a schematic representation of an embodiment of the Gene Delivery 
Fusion Protein (GDFP) concept using a Typ^-I GDFP. 

Figures 2A and 2B are diagrams of the cloning strategy used to generate 
expression vectors encoding GAL4, and the GAL4/IL-2 and IL-2/GAL4 GDFPs. 
20 Figure 3 is an SDS-PAGE gel of 35-S labeled GAL4/IL-2m GDFP. 

Figure 4 is a gel-shift assay showing retention of DNA binding activity by the 

GAlA/ILrTm GDFP. 

Figure 5 diows retention of 11^2 bioactivitytqr the GAL4/^ « 
Figuie 6 is an SDS-PAGE gel of 3S-S labeled GAL4/n^2 GDFP and IL- 
25 2/GAMGDFP. 

Figue 7 shows sequenoe*specific DNA bindiqg of the GAL4 protein and die IL- 
VGAIA dud GA1>I/IL^2 GDFPs. 

nguxt 8 diows the cytoldne bioactivity of the n/-2/GAI4 an^ 

GDFPs. 

30 Figure 9 shows the resuta erf an assay demonstratiqg the ability of GDFPs to 

bind to ILr2 recq^tor-bearing CTLL. 
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Figure 10 shows the results of an assay demonstrating the abili^ of GAL4/IL-2 
GDFP and IL*2/GALI GDFP to mediate binding of a target oligomer to IL-2 
receptor*bearing CTLL. 

Figure 11 shows the results of an assay demonstrating the abiliQr of GAL4/IL-2 
5 GDFP to mediate binding of a target plasmid to IL-2 receptor-bearing CTLL. 

Detailed Description of the Invention 
The invention provides a non-viral gene delivery system by which DNA, RNA 
and/or analogs thereof ("targeted nucleic acid" or "tNA" to be used in gene delivery) 

10 are modified by association with a gene delivery fusion protem (GDFP). The non-viral 
gene delivery system of the present invention conqnrises a macromolecular conq>lex of 
two sqKurate entities: Ae targeted nucleic acid to be delivered* and a GDFP. The 
GDFP conqirises a nucleic acid binding domain (NBD) that can bind to the targeted 
nucleic acid and thus lead to the formatkm of a GDFP/fNA CQnq>lex: fused to a gene 

IS deliveiy domain (GDD) that can mediate or facilitate the delivety of the GDFP/tNA 
complex into die target cells. 

In a prefierred embodiment of the nivention the open reading ftames encoding 
the various GDFP domams and components are fused to enable expression of die 
GDFP as a single polypeptide. However, die GDFP may also comprise, for exanq)le, 

20 one or more shon flexible peptide linker sequence ("flexons") between the individual 
domains and/or conqK>nents. 

G^BCTBi PefiiMtiTOff 

The terms "polypqMide*, "pepticfe" and "protein* are used interchangeably to 
25 refer to polymers of amino acids and do not refer to any particular lei^gdis of the 
potymeis. These terms also indtude post-translationally modified proleinSt for 
exampte, glycosylated, ace^lated, phosphoiyUited proteins and die Alsoinchided 
widiin die defimtioii are, for exanq>le, proteins containiiig one or more analogs of an 
amino acid (inchidiitg, for example, unnatural amino acids, etc.), protehis widi 
30. mbstituted linkages, as weU as odier naodificatioiis 
occurring and non-naturally occurring. 
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''Nadve' polypeptides or polynucleotides refer to polypeptides or 
polynucleotides recovered from a source occurring in nature. Thus, the phrase "native 
viral binding proteins'* would refer to naturally occurring viral binding proteins. 
"Mutein" fonns of a protein or polypeptide are those which have minor 
5 alterations in amino acid sequence caused, for example, by site-specific mutagenesis or 
other manipulations; by errors in transcription or translation; or which are prepared 
synthetically by rational design. Minor alterations are those which result in amino acid 
sequences wherein the biological activity of the polypeptide is retained and/or wherein 
the mutein polypeptide has at least 90% homology with the native form. 

10 An "analog" of a polypeptide X inchides fragments and muteins of polypeptide 

X that retain a particuhur biologkal activity; as well as polypeptide X thai has been 
incorporated into a larger molecule (other than a molecule within which it is normally 
found); as weU as synthetic analogs that have been prepared by rational design. For 
example, an analog of a DNA binding protein mi^ refer to a portion of a native DNA 

15 bindiqg protein that retains the ability to bind to DNA, to a mutein thereof, to an entne 
native binding protein that has been incorporated into a recombinant fusion protein, or 
to an analog of a native binding protein dial has been synthetically prepared by rational 
design. 

"Polynucleotide" refers to a polymeric form of nucleotides of any length, either 
20 ribonucleotides or deoxyribonucleotides, or analogs thereof. This term refers only to 

the primary structure of the molecule. Thus, double- and single-stranded DNA, as well 
as double- and single- stranded RNA are inchided« It also inchides modified 
polynucleotides such as methylated or capped polynucleotides. 

An "analog" of DNA, RNA or a polynucleotide, refers to a macromolecule 
.25 resembUqg naturall^-occuxring polynucleotides in form and/or function (particularly in 
the ability to oitgage in seq[uenDe-speci& l^dro 

conq)lementaxy polynucleotide sequence) bui which diffeis from DNA or RNA in, for 
exampk, die possession of an unusual or non-nalur^ A 
large variety of such molecules have been described for use in antisense technology; 
30 see, e.g., E. UUmaim el al. (1990) Chemical Reviews 90:543-584, and the 
publications reviewed Oerdn. 
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An "antisense'' copy of a paiticular polynucleotide refers to complementary 
sequence that is capable of hydrogen bonding to the polynucleotide and may, therefore, 
be capable of modulating expression of the polynucleotide (i.e. by "antisense" 
regulation). Such an antisense copy may be DNA, RNA or analogs thereof, inchiding 

5 analogs having altered backbones, as described above. The polynucleotide to which the 
antisense copy binds may be in single-stranded form (such as an mRNA molecule) or in 
double-stranded form (such as a portion of a chromosome). 

A "rq)licon" refers to a polynucleotide comprismg an origin of replication 
(generally referred to as an sol sequence) which allows for replication of die 

10 po^nucleotide in an q)propriate host cell. E3unq>les inchide replicons of a target cell 
into which a desired nucleic acid migbt integnoe (in particular, nuclear and 
mitochoDdrial chromosomes; and also extnudmmwsomal replicons such as plasmids). 

"Recombmant,* as applied to a polymcleotide. noeans that the polynucleotide is 
the product of various combinatimis of cloning. lestricttep and/or ligation 8tq» 

15 resulting in a eonstnict that is distinct from a polynucleotide found in natim. 

"Reccmbinant' may also be used to to the protein product of a reconibinant 
polynucleotide. Typically, DNA sequences encoding the structural codfaig sequence 
for. e.g.. components of the NBD and GDD, can be assenibled from cDNA firagnuBnts 
and short oUgonucleotide linkers, or from a series of oligonucleotides, to provide a 

20 synthetic gene which is capable of being expressed when operably linked to a 

transcriptional regulatory region. Such sequences are preferably provided in the form 
of an open reading frame tmintemqrted by internal non-translated sequences (i.e. 
"introns"), such as those commonly found in eukaryotic genes. Such sequences, and 
all of die sequences refimed to in the context of the present invention, can also be 

25 generally obtained by FCR amplification using viral, prokaryotic or eukaryodc DNA or 
. RNA teii^)lales in conjunction widi tq^proptiate PGR amplim e r s. 

A 'tecombinatt expression vector" refim to a potynudeotide ^iddcfa contains a 
transcrq>tional t^ulaloiy region and coding seqoences necessary for die o^resskin of 
an RNA mokcute and/or protein and whidi is capabte of bdng introduced Ibid ai target 

30 cell (by, e.g., viral infiecdon, transfecdon. elecuupora d on or Iqr die non-viral geoe 

delivery (NVGD) techniques of die present invendmO. A furdw exanple would be an 
expression vectw used to tacpna a GDFP of die present invenlloo. 
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"Recombinant host cells", "host cells", "cells", "target cells", "cell lines", "cell 
cultures", and other such terms denote higher eukaryotic cells, most preferably 
mammalian cells, which can be. or have been, used as recipients for recombinant 
vectors or other transfer polynucleotides, and inchide the progeny of the original cell 
5 which has been transduced. It is understood that the progeny of a single cell may not 
necessarily be completely identical (in morphology or in genomic or total DNA 
complement) to the original parent cell, due to natural, accidental, or deliberate 
mutation. 

An "open reading frame" (or "ORF") is a region of a polynucleotide sequence 

10 that can encode a polypq)tide or a portion of a polypeptide (i.e., the region may 

represent a portion of a protein coding sequence-or an entire protein coding sequence). 

"Fused" or "fusion" refers to the joining together of two or more elements, 
con4X)nents, etc.. by ^i^tever means (including, for example, a "fusion protein" made 
by chemical coiqugation (whether covalent or non-covalent), as well as the use of an 

15 in-frame fusion to generate a "fusion protein" by recombinant means, as discussed 

infra). An "in^fhone fusion" refers to the joining of two or niore open reading 
frames (ORFs), by recombinant means, to form a single larger ORF. in a manner that 
maintains the correct reading frame of the original ORFs. Thus, the resulting 
recombinant fusion protein is a single protein containing two or more segments that 

20 correspond to polypeptides encoded by die original ORFs (which segments are not 

normally so joined in nature). Although die reading frame is dius made continuous ^ 
throughout the fused segments, the segments may be physically separated by, for 
exanq)te. in-frame flexible polypeptide linker sequences ("flexons"), as described infra. . . 
A "flexon* refiers to a flexible polypeptide linker sequence (or to a nucleic acid 

25 sequence encoding such a polypeptide) which Qnpically conqirises aminp acids havii^ 
small side diains (e,g., glycine, alanine, valine, leocine, isoleucine and serine). In the 
present invention, flexons can be incorporated in the GDFP between one or more of the 
various domains and components. Incorporating flexons between these components is 
believed to pramote fiinctionaliQf by allowing them to adopt confionnatipns relatively . 

30 independjBnfly from each other. Most of the amino acids incoiponted into the flexon 
win preferably be amino acids halving smaU ade chains. The flexon will prefierBbly 
conoqinrise between about four and one hundred amino acids, more prefeind)^ behvMi 
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about eight and fifty amino acids, and most preferably between about ten and thir^ 

amino acids. Flexon ("Pixy") sequences described in U.S. Patents 5,073,627 and 

5,108,910 will also be suitable for use as flexons. 

A "transcriptional regulatory region" or "transcriptional control region" refers to 
5 a polynucleotide encompassing all of the cis-acting sequences necessary for 

transcription, and may include sequences necessary for regulation. Thus, a 

transcriptional regulatory region includes at least a promoter sequence, and may also 

include other regulatory sequences such as enhancers, transcription factor binding sites, 

polyadenylation signals arKi splicing signals. 
10 "Operably linked" refers to a juxtaposition wherein the components so described 

are in a relationship permitting them to function in their intended maimer. For 

instance, a prcmioter sequence is opmbly linked to a codiiig sequence if the promocer 

sequence promotes transcription of the codir^ sequence. 

"Transduction," as used herein, refers to the introduction of an exogenous 
15 polynucleoti(te into a host cell, irrespective of the mediod used for the insertion, whidi 

methods include, for exanqple, transfectkm* vkal infection, transformation, 

ele c tr opor ation and the non-viral gene delivery technique of die present invention. 

The introduced polynucleotide may be stably or transiently maintained in the host cell. 

Stable maintenance typically requires that ibt introduced polynucleotide either contains 
20 an origin of replication compatible widi the host cell or integrates into a replicon of the 

host cell such as an extrachromosomal replicon (e.g. a plasmid) or a nuclear or 

mitochondrial chromosome. 

"Retroviruses" are a class of vuruses which use RNA-directed DNA polymerase, 

or reverse transcriptase, to replicate a viral RNA genome resulting in a double-stranded 
25 DNA intermediate which is incorporated mto chromosomal DNA of an avian or 

mammalian host cell. Many such retrovinises are known to those skilled in tte 

afg dggcrihed far gxample- in Wcm ef rf„ isd«, RN^ T^«m YiniiHff ■ ^ , 

Spring Harbor, New York (1984 and 1985). Plasmids containqg retroviral genomes 
are also widely available, from die American Type Culture CoUectiofi (ATCCi and 
30 odier sources. The nucleic add sequences off a large number of these viruses are 

known and are generally available, fior example, from databases such as GENBANK. 
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A **sequence-specific nucleic acid binding protein" is a protein that binds to 
nucleic acids in a sequence-specific manner, i.e., a protein that binds to certain nucleic 
acid sequences (i.e. "cognate recognition sequences", infra) with greater affinity than to 
other nucleic acid sequences. A "sequence-non-specific nucleic acid binding protein" is 
5 a protein that binds to nucleic acids in a sequence-non-specific manner, i.e. a protein 
that binds generally to nucleic acids. 

A "cognate" receptor of a given ligand refers to the receptor normally capable 
of binding such a ligand. A "cognate" recognition sequence is defined as a nucleotide 
sequence to which a nucleic acid binding domain of a sequence-specific nucleic acid 
10 binding protein binds with greater affinity than to other nucleic acid sequences. A 
"cognate*" interaction refers to an intermolecular association based on such types of 
binding (e.g. an association between a receptor and its cognate ligand, and an 
association between a sequence-specific nucleic acid binding protein and its cognate 
nucleic acid sequence). 

IS "Gene delivery" is defined as the introduction of targeted nucleic acid into a 

target cell for gene transfer and may encompass targeting/binding, uptake, 
transport/localization, replicon integration and expression. 

"Lymphocytes" as used herein, axe spherical cells with a large round nucleus 
(which may be indented) and scanty cytoplasm. They are cells diat specifically 

20 recognize and respond to non-self antigens, and are responsible for develq>ment of 
specific immunity. Inchided widun "lynq)hocytes" are B-lymphocytes and T- 
lymphocytes of various classes. 

"Lymphohematppoietic stem cells" are cells ^i^iiich are typically obtained from 
the bone manow or pe rip heral blood and which are capable of giving rise, through cell 

25 division, to any mature cells of ibt lymphoid or hematopoietic systems.. Hiis tenn 
includes committed progenitor cells with significant thcmgh limited cqiacity for self« 
renewal, as well as die more primidve cells mch as those capable of forming spleen 
colonies in a CFU-S assqr, and still more primitive cells possessing lopg-teim and/or 
multilineage re-populating ability in a transplanted wiafntnttiiftti Iml 

30 . "Lyn^hobematopoietic ceUs" include die various mature celb of th^ 

or hematopoietic systems (inchiding lymphocytes and odier blood cells), as well as 
lyn^hohematqiofetic stem ceOs. 
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A "primary culture of cells" or "primaiy cells" refer to cells which have been 
derived directly from in vivo tissue and not extensively passaged. Primaiy culnires can 
be distinguished from cell lines and established cultures principally by the retention of a 
kaiyotype which is substantially identical to the kaiyotype found in the tissue from 
which the culture was derived, and by the celhilar responses to manipulations of the 
environment which are substantially similar to the a m& cellular responses. 

As is described in detail below, the non-viral gene delivery complexes of the 
present invention comprise gene delivery fusion proteins (GDFPs) that bind targeted 
nucleic acid through a nucleic acid binding domain (NBD) and facilitate gene delivery 
through a gene delivery domain (GDD). Each of these domains can comprise a 
number of different functional components and sub-components. Some of these 
potential components are sununarized in the following list: 

NON-VmAL GENE DELIVF ^Y T^MPI l^C fthg ^GDFP/tNA Compter 

1, Gmg DdWcrv Rision Protdn (GDgP> 

A. Nucleic Acid Bnidiqg Domain (NBD) 

(1) Nucleic Acid Binding (NB) cooqxment 

Q) Other possible components (e.g. mediatiqg compression of tNA) 

B. Gene Deliver Domain (GDD) 

(1) Binding/Targeting (B/T) component 

(2) Membrane-Disrupting (M-D) conqxmeot 

(3) Transport/Localization (T/L) component 

(4) Replicon Integration (RI) compoiKnt 

2, Targgted Nu rfric Add ftNA) 

A. Binding sites for the GDFP (see infra) 

B. Sequence of interest (e.g. gene ID be delivered) 

C. Oil» possibte sequences (e.g. selectabte mai^ 

Pyh of Aese domains and components, as weU as additional elements diat may 
be inchided* are defined and described m detul bdow. 
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The practice of the present invention will employ^ unless otherwise indicated, a 
number of conventional techniques of molecular biology, microbiology, recombinant 
DNA, and immunology, which are within the skill of the art. Such techniques are 
explained fully in the literamre, see, e.g., Kriegler, M. (ed.), "Gene Transfer and 

5 Expression, a Laboratory Manual," (1990), W.H. Freeman Publishen; Sambrook, 
Fritsch, and Maniatis, "Molecular Cloning: A Laboratory Manual," Second Edition 
(1989); F.M. Ausubel et al. (eds.), "Current Protocols in Molecular Biology," (1987 
and 1993); MJ. Gait (ed.). ''Oligonucleotide Synttesis," (1984); RJ. Freshney (ed.), 
"Animal Cell Culture," (1987); J.M. Miller and M.P. Calos (eds.), "Gene Transfer 

10 Vectors for Mammalian Cells," (1987); D.M. Weir and C.C. Blackwell (eds.). 

"Handbook of E}q)erimental Immunology;" J-E. Coligan, A.M. Kruisbeek, D.H. 
Margulies, E.M. Shevach and W. Strober, (eds.)» "Current Protocols m Immunology," 
(1991); and the series entitled "Methods in Enzynx>logy," (Academic Press, Inc.). All 
patents, patent q>plications, and publications mentioned herein, both siqm and infia, 

IS are hereby incorporated herein by reference. 

mustrationg of Tvpe-I Gene DeMverv Fteion Protdm 

The Gene Deliverv Fusion Protein / Targeted nucleic acid Complex f GDFP/tNA> 
20 One concept of the present invention is to create recombinant gene delivery 

fusion proteins (GDFPs) that are able to bind to a cognate recognition sequence in a 
targeted nucleic acid (tNA) and facilitate delivery of Oe tNA into a target cell. The 
GDFPs bind targeted nucleic acid through a nucleic acid binding domain (NBD) and 
facilitate gene delivery through a gene delivery donuun (GDD). 
25 Thus, in the context of the present invention, targeted nucleic acids can be 

delivered via one or more steps that are mediated or augmented by GDFPs. In 
particular, die gene delivery process can hshide one or more of the foUowipg steps: 

(1) buiding and/or targedpg of the GDFP/tNA con^lex to the surface of a target cell; 

(2) uptake of Oe tNA (widi or wiflxnit the GDFF) 1^ the target cell; 0) intracettular 
30 transport and/or localization of the tNA to an oisanelle such as a nucleus or 

mitochondrion; and (4) integration of the tNA into a ceUular replicon such a 
chromosome. A particular GDFP iieed nm iKcessarity perform aU of dxse fimctiw^ 
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For example, a GDFP intended to deliver an expression vector to the nucleus of a cell 
could be constructed to contain: (i) an NBD capable of binding to a cognate recognition 
sequence on the expression vector and; (ii) a GDD having only a transport/localization 
component such as a nuclear localization sequence. Such a GDFP could then be 
5 complexed with targeted nucleic acid and introduced into target cells by a transduction 
method such as electroporation. The GDFP would then £icilitate transport/localization 
to the nucleus, perhaps to a specific site in a replicon, and thus enhance expression of 
tte vector. Alternatively, for example, the aforementioned GDD could be modified to 
include a binding/targeting conq)onent and a membrane^disnipting component. Using 

10 such a GDD, the GDFP/tNA complex could be directed to a particular cell type within 
a population of cells, and uptake of the conq)lex could proceed without the need for, 
e.g., electroporation. Use of the GDFPs in conjunction with techniques such as 
elecinqxmtion, as in the former exanq>Ie, would of course be more appropriate for in 
vitro gene delivery. Use of GDFPs as described in the latter example could be readily 

15 i^lied to die delivery of genes either in vitro or in vivo. Snnilaity, the GDFP/tNA 
complexes could be used as admixtures with odier proteins or mnplt chemicals diat 
enhance gene ddiveiy. This could include, for exaniple, efihaneing the uptake of 
GDFP/tNA complexes by adding membrane disrupting agents in Uas* 

Other combinations of components can be prepared (and particular versions of 

20 fbc campoatm can be selected) according to the specific design objectives of the gene 
delivery scheme. These obfectives include, for exanq>]e, the location of the cells to be 
targeted, die desired cellular specificity of targeting, and the desired sub-cellular 
destinatira of die tNA- 

The individual domains and conqionents of die GDFP/tNA complex and their 

25 construcdon and assembly are described in more detail below. 

L The Gene Dcliverv Fusion Protein (GDFPl 

The GDFP comprises two major domains, a nucleic acid binding domain (NBD) 
and a gene delivery domain (GDD). Each of these major domains conqirises one or 
30 . more components fecilitating nucleic acid binding and gene delivery, respecdvdy. 

These individual conqxments may be derived from naturally*K)cciirring proteins, or they 
inay be syndietic (e.g. an analog of a naturally-oocuning conqponent). Typicalty, 
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cloned DNA encoding various components will already be available as plasmids - 
although it is also possible to synthesize polynucleotides encoding the components 
based upon published sequence information. Polynucleotides encoding the components 
can also be readily obtained using polymerase chain reaction (PCR) methodology, as 
5 described, for example, by MuUis and Faloona (1987) Meth. Enzymology 155:335. 
In the construction of the GDFP, discussed in more detail below, DNA 
sequences encoding the domains and their various components are preferably fused 
in-frame so that the GDFP can be conveniently synthesized as a sii^le polypeptide 
chain (i.e. not requiring further assembly). The various domanis and components can 
10 also be separated by flexible peptide linker sequences caUed "flexons" which aie 
defined in more detail above. 

A. The Nucleic Acid Binding Domain fNBD> 

A nucleic acid binding domain is a length of polypeptide capable of binding 

15 (either directly or indirectly) to die targeted nucleic acid (tNA) with an affmity 

adequate to allow the gene delivery domain of the GDFP to mediate or augment the 
delivery of tte tNA mto a target ceU. Most conveniently, the NBD wiU bind directly 
to the tNA without the need for any intermediary bindiqg element. 

In Type-I GDFPSt the NBD contains a sequence-specifk bindiqg conqxinent that 

20 is an analog of a sequence-specific nucleic acid bindmg protein. In one prefened 

embodiment of this type; the co mpon en t allows the nucleic acid bindiqg by the NBD to 
be sequenoe-spedfic widi respect to the tNA« in which case Ae NBD may bind to a 
specific cognate recognition sequence whfain the fNA; as is illustrated in Figure L 
As described herein* one particular advantage of the Type I GDFP approach is 

25. Aat it not only allows die stoichiometric attachment of delivery conqxments to d^ tNA, 
but also allows the GDFP to be positioned at pre-determtned locations with reqiect to 
die tNA. For exanqile, the posidoniiQ of NBD cognate recognition sequeiices in 
proximity to terminal integrase recognition sequences can facilitate the use of GDFPft to 
mediate integration* as described below, 

30 The NBD may comprise, for exanq)le, a known nucleic acid binding protein, or 

a nucleic acid binding reg|ion thereof. The NBD may also conqmse two or more 
nucleic add binding regions derived from die same or different nucleic acid biiKliiig 
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proteins. Such multimerization of nucleic acid binding regions in the NBD can allow 
for the interaction of the GDFP with the targeted nucleic acid to be of desirable 
specificity and/or higher affinity. This strategy can be used alone or in combination 
with multimerization of recognition sequence motift in the tNA to increase binding 
5 avidity, as discussed below. 

DNA encoding the NBD domain of the GDFP may be obtained from many 
different sources. For example, many proteins that are capable of binding nucleic acid 
have been molecularly cloned and their cognate target recognition sequences have been 
identified (see, e.g., MitcheU & Tjian, Science 245:371-378, 1989; Pabo & Sauer, 

10 Ann. Rev. Biochem. 61:1053-1095, 1992; Harrison, S.C., Nature 353:715-719, 1991; 
Johnson & McKnight, Ann. Rev. Biochem. 58:799-839, 1989; and references reviewed 
therein, hereby mcorporated by reference). Such sequence-specific bindiqg proteins 
include, for exanq)le, regulafoiy proteins such as those involved in transcr4)tion or 
nucleic acid replication, and typically have a modular construction, consisting of 

15 distiiKt DNA binding domains and regulatory domains (see,, e.g., Struhl, Cell 49:295- 
297, 1987; Frankel and Khn, Cell 65:717-719, 1991; and Pftbo & Sauer, Am. Rev. 
Biochem. 61:1053-1095, 1992; and references reviewed therein, hereby incorporated 
by reference). A number of families of such nucleic acid binding proteins have been . 
characterized on the basis of recurring strucoiral motife including, for exanq)le, 

20 Helix-Tum-Helix proteins such as the bacteriophage lambda cl repressor, 

Homeodomain proteins such as the Drosophila Antennapedia regulator; the POU 
domain present in proteins such as die mammalian traiiscrq>tion factor Oct2; Zhjc 
fingn proteins (e.g. GAL4); steroid recq)tors; leucine zipper proteins (e.g. (3CN4, 
C/EBP and c-jun); beta-sheet motift (e.g. the prokaryotic Arc repressor); and other 
. 25 families (inchiding serum response fiicmr, oncogenes such as c-myb, NFkB and rel, 
and others); see, e.g., Pabo & Saner, Ann. Rev. Biodnn. 61:1053-1095, 1992, and 
references reviewed therein, hereby, incorporated by reference. 

For many of these proteins, the nucleic acid buidiiv domains have been mapped 
in detail; and, for a nuinber of such donsains, recombinaitt fus^ 

30 sequences have been made and shown to retain die binding activities of the parental 
DNA binding donudn. For example, in die case of the yeast-derived transcrqidonal 
actxvaior GAL4, the DNA bindii^ domain has been defined, and fiisions of diis domam 
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to heterologous adjoining sequences have been made that retain DNA sequence-specific 
binding activity (Keegan et al., Science 231:669-704, 1986; Ma & Ptashne, Cell 
48:847-8S3, 1987). This ability to functionally 'swap' binding domains has also been 
shown for a number of other DNA binding proteins, including, for example, the E, coli 
5 lex A repressor (Brent and Ptashne, Cell 43:729-736, 1985), the yeast transcriptional 
activator GCN4 (Hope and Struhl, Cell 46:885-894, 1986), the bacteriophage lambda 
cl repressor (Hu et al.. Science 250:1400-1403, 1990), the mammalian transcription 
factors Spl (Kadonaga et al.. Cell 51:1079-1090, 1987) and C/EBP (Agre et al.. 
Science 246:922-926, 1989). Similariy* functional swapping has been rqwrted in the 

10 nuclear DNA-binding steroid hormone receptors (see, e.g.. Green and Chambon, 
Nature 325:75*78, 1987). See also, e.g., Khig Jk Rhodes, Trends Biocfaem. Sci. 
12:464-471, 1987; Berg. CeU 57:1065-1068, 1989; Wasylyk et al., Eur. J. Biochem. 
211:7-18, 1993; Faisst & Meyer, Nucl. Acids Res. 20:3-26, 1992; Struhl, Trends 
Biochem. Sci. 14:137-140, 1989; and Nelson & Sauer, CeU 42:549-558, 1985. 

15 Sequence-specific nucleic acid buiding proteins cari exhibit a rar^e of bbiding affinities 
to different cognate nucleic acid sequences in vitro (see, e.g., Vashee et al., J. 
Biol.Chem 268:24699-24706, 1993). 

Virally encoded nucleic acid binding proteins can also be used in tte present 
invention. These include, for example, die adenovirus E2A geoc product, which can 

20 bind single-stranded DNA, double-stranded DNA and also RNA (Cleghon et al.. 

Virology 197:564-575, 1993, and references cited therein); the retroviral IN proteins 
(Krogstad & Chanqxnix, J. ^rol 64: 2796, 1990); the AAV rq) 68 and 78 proteins 
(Owens et al., J. Virol 67: 997, 1993); and the SV40 T antigen (Arthur et al., J. 
ViroL, 62:1999-2006, 1988). The celhilar p53 gene product, which binds T antigen, is 

25 also a DNA binding protein (Funk tt al., Mol. CeU BioL, 12:2866-2871, 1992). 

Sinoilaily, RNA bindiqg proteins have been identified and their inclusion in the 
NBD would associate the GDFP widi a targeted RNA and thereby achie^ 
delivery mediated by the gene delivery domain of die GDFP. RNA brndir^ protems 
that can be used in die coiiiatt of the present invoation iiciude, to 

30 and Rev proteins of HIV; see, e.g., Tdey et al., P.N.A.S. 89:758-762, 1992; and 

CuUen et al., CeU 73:417-420, 1993. Shnilaily, ceUular RNA bmdtr« proteins, such 
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as the interferon-inducible 9-27 gene product (Constantoulakis et a!.. Science 259:1314- 
1318, 1993). can also be used. 

Nucleic acid binding domains of Type-I GDFPs can also contain (in addition to 
a component derived from a sequence-specific nucleic acid binding protein) one or 
5 more components that are derived from sequence-non-specific nucleic acid binding 
proteins. Such sequence-non-specific binding proteins inchide, for example, histones 
(von Holt, Bioassays 3:120-124, 1986; Rhodes, Nucleic Acids Res. 6:1805-1816, 
1979; Rodriguez et a!., Biophys. Chem. 39:145-152. 1991); proteins such as nucleolin 
(Eranl et al., Eur. J. Biochem. 191:19-26, 1990); polybasic polypeptide sequences such 

10 • as pcly-L-lysine (U et al.. Biochemistry. 12:1763-1772 1973; Weiskopf and Li. 

Biopolymers 16:669-684, 1977), avidin (Paidridge & Boado, F.E.B.S. Lett. 288:30-32 
1991); the non-histone high mobility group proteins and other proteins (see, e.g., Pabo 
& Saucr, Ann. Rev. Biochem. 61:1053-1095, 1992. and references reviewed therelq): 
that interact non-specifically with nucleic adds. Other proteins binding nucleic add in 

15 a sequence-non-specific fashion inchide retroviral nudeocapsid (NQ protefais (see. 
e.g.. Gdfand et al., J. Biol. Cbem.. 268:18450-18456, 1993). 

B. The Gene Ddiverv nomnin /r,T)p) 

The ODD portion of the GDFP contains one or more polypeptide regiou that 
» mediate or augment the efiicienpy of gene delivecy. Such sequences may inchide. for 
exanqpk, binding/targeting components, membrane-disiuitting coaqKmeitt. 
transport/localizttion con^onems. and rq>licon uu^ration con^onents, as discussed 
bekfw. 

A particular ODD need not contahi a component rq>resenting each of the 
!5 aforementioned types. Cbnveisely. a GDD may contain more than a single conqwneitt 
<tf a given type to obtain the desired activity. Moreover, a particular segment <rf a 
GDD might serve the function of two or more of these components. For exanple. a 
smgle region of a polypeptide might function boA hi bindmig to a ceO sutfiKc and in 
disnqxion of the membrane at that surface. 



30 
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m Binding/Targeting (B/T) Components 

Binding/targeting components are regions of polypeptides that mediate binding 
to cellular surfaces (which binding may be specific or non-specific, direct or indirect). 
Any protein that can bind to the surface of the desired target cell can be employed as a 
5 source of B/T components. Such proteins include, for example, ligands such as 

cytokines that bind to particular cell surface receptors, antibodies, lectins, viral binding 
proteins, cellular adhesion molecules, and any other proteins that associate with cellular 
surfaces. The "receptors" for these binding proteins include but are not limited to 
proteins. Moreover, the receptors may, but need not, be specific and/or restricted to 

10 certain cell types. Essentially, the B/T components can be prepared from any ligand 
that binds to a cell surface molecule. 

By way of illustration, one group of proteins from which the B/T components 
can be derived are cytokines. Cytokines are intercellular signallii^ molecules, the best 
known of which are involved in the regulation of mammalian somatic cells. Several 

15 families of cytokines, both growth promoting and growth inhibitory in their effects, 
have been characterized. Thus, a B/T component can conqvrise an amino acid 
sequence contaming at least that portion of a ^tokine polypeptide that is required for 
bindmg to recepton for the cytokine on the surface of mammalian cells, or a mutein of 
such a portion of a cytokine polypquide. A B/T component derived from a cytokine 

20 can, but need not, also contain the portion of the cytokine tfiat is involved m "cytoldne 
effector activi^," as described bekm. v 

Examples of cytokines that can be used in the present invention mchide, for 
exanq)le, interieukins (sudi as IL-la, ILrljS, IL-2, Er3, UM, IL-5, IL-6, IL-7, IL-9 
(P40), IL-10, IL-11, IL-12 and IL-13); CSF-type cytokines such as GM*CSF, G-CSF, 

25 M-CSF, UF, EPO, TNF-a and TNF-/J); interferons (mth as IFN-a, IFN-/3, IFN-^); 

cytokines of the TGF-/8 family (such as TGF-/31, TGF-^, TGF-/33, inhibin A, inhibin 
B, activin A, activin B); chonotactic factors (such as NAP-1, MCP-1, MlP-la, 
MlP-liS, NfIP-2, SIS/9, SIS5, SISc, PF-4, PBP, MGSA); growdi factors (such 

. . as EGF, TGF-a, aFGF, bFGF, KGF, PDGF-A, PDGF-B, PD-ECGF, INS, IGF-I. 

30 IGF-n, NGF-A; a-type intercriiK cytokines (such as IL^, GRO/NIGSIA, PF-4, 

PBP7CTAP//nX3, IP-10, MIP-2, KC, 9E3); and /9-type mtercrine cytokines (such as 
MCAF, ACT-2/PAT 744/G26, LD-7S/PAT 464, RANTES, G26, 1309, JE, TCA3, 
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MlP-la,^, CRG-2). A number of other cytokines are also known to those of skill' in 
the art. The sources, characteristics, targets and effector activities of these cytokines 
have been described and, for many of the cytokines, the DNA sequences encoding the 
molecules are also known; see, e.g.. Van Snick, J. et al. (1989) J. Exp. Med. 169: 

5 363-368; Paul, S.R. et al. (1990) Proc. Natl. Acad. Sci. USA 87: 7512-7516; Gately, 
M.K. et al. (1991) J. Immunol. 147: 874-882; Miniy, A., et al, (1993) Nature 362: 
248; and the reviews by Arai, K., et al. (1990) Amm. Rev. Biochem. 59:783-836; and 
Oppenheim, J.J„ et al. (1991) Annu. Rev. Immunol. 9:617-48; Waldman, T.A. (1989) 
Annu. Rev. Biochem. 58:875-911; Beutler, B., et al. (1988) Annu. Rev. Biochem. 

10 57:505-18; Taniguchi, T. (1988) Annu. Rev. Immunol. 6:439-64; Paul, W.E. et al., 
(1987) Annu. Rev. Immunol. 5:429-59; Pestka, S. et al., (1987) Annu. Rev. Biochem. 
56:727-77; Nfcola, N.A. et al. (1989) Amm. Rev. Biochem. 58:45-77; and Schrader, 
J.W. (1986) Annu. Rev. Immunol. 4:205-30; and the particular publications reviewed 
and/or cited Aerein, which are hereby mcorporated by reference in their entiieQr. 

15 Many of tbt DNA sequences encoding cytokines are also generally available from 
sequence databases such as GENBANK. Typically, ckmed DNA encodmg such 
cytokines will ahea^ be availabk as plasmids - although it is also possible to 
synthesize potynucleotides encoding the c^kines based upon the published sequence 
information. Polynucleotides enco(fing die c^tcddues can ato 

20 polymerase chain reaction (PGR) methodology, as described, for exanq>le, by MuUis 
and Faloona (1987) Medi. Enzymology 155:335. The detection, purification, and 
characterization of cytddnes, including assays fw identifying new cytokines effective 
upon a given target cell typt^ have also been described in a number of publications, 
including, e.g., Qemens, M.J. et al. (eds.) (1987) "Lynq)hokines and Interferons," 

25 IRL Press, Oxford; and DeMaeyer, E., et al. (1988) "Interferons and Other Regulatoiy 
Cytokines," John Wil^ & Sons, New York; as well as the references refened to 
above. 

The Ugands suitable for taigetiqg a particular sub-population of cells wiU be 
those which bmd to receptors present on oeUs of that sub-populatioa. Again, taking 
30 cytokines as an example, the target cells for t large mnnber of these molecules are 
already known, as noted above; and, in many cases, the particular cell surface 
recqrtors for the cytokme have already been klentified and characterized; see, e.g., the 
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publications referred to above. TypicaUy. the cell surface receptors for cytokines are 
transmembrane glycoproteins that consist of either a single chain polypeptide or 
multiple protem subunits. The receptors generally bind to their cognate ligands with 
high affinity and specificity, and may be widely distributed on a variety of somatic 
cells, or quite specific to given cell subsets. The presence of cytokine receptors on a 
given cell type can also be predicted from the ability of a cytokine to modulate the 
growth or other characteristics of the given cell; and can be determined, for exanq)le, 
by monitoring the binding of a labeled cytokine to such cells; and other techniques, as 
described in the references cited above. 

Thus, for example, a large number of cytokine receptors have been 
characterized and mai^ of these are known to belong to receptor families which share 
similar structural motifs; see, e.g., the review by Miyajnna, A., et al.. Ann Rev. 
ImmunoL 10:295*331 (1992), and the publications reviewed therein* hereby 
incorporated by reference. T>pe-I cytokine receptors (or hematopoietic growth factor 
receptors) inchide, for example, the receptors for IL-2, IL-3, IL4, IL-5, ILr6;*IL-7, 
GM-CSF, GCSF, EPO, CNTF and UF. Type-II cytokine receptors inchide, for 
example, the receptors for IFN-d, IFN-^ and IFN-y. Type-in cytokine receptors 
inchide* fm exanqile, the recqitors TNF-a, TNF-/S, FAS, CD40 and NGF. Type- 
IV ^tokine recq»tor8'(imnniiioglobulin-libe, or "Ig-like," receptors) mchide the*^ 
receptors for BL-l; and Ae receptors for IL^ and G-CSF (which have Ig-like motift in 
addition to die l>pe*I motif). These receptor families are described for exanq>le, in 
Smith et ai;. Science 248: 1019-1023, 1990); Larsen et al., J. Exp. Med., 172: 1559- 
1570, 1990); McMahan et al., EMBO J. 10:2821-2832, 1991); and in the reviews by 
Cosman et al.. Trends Biochem Sd 15: 265-269, 1990); and Miyajnna, A., et al., 
Ann. Rev. ImmunoL 10:295-331 (1992), and the publications reviewed dierein, all of 
which are hereby incorporated by refereiKe. As new q^kuKS are ^^^^^rmT^, these 
can be en^loyed in the present invention as loqg as they exhibit the desired binding 
characteristics and specifici^. The identification and charactarizaiion of cytokines, and 
the use of assays to test die ability of cytokines to activate particular target cells, are- 
known in die art; see, e.g., Ckmens, M.I. et al. (eds.) (1987) "Lynqihokiiies and 
Interferons,* IRL Press, Oxfind; and DeMaeyer, E., et al. (1988) "interferons and 
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Oiher Regulaioiy Cytokines/ John Wiley & Sons, New York; as well as the references 
referred to above. 

The choice of a particular ligand will depend on the presence of cognate 
receptors on the desired target cells. It may also depend on the corresponding absence 
5 of cognate receptors on other cells which it may be preferable to avoid targeting. With 
the cytokines, for exanq)le, the role of particular molecules in the regulation of various 
celhilar systems is well known in the art. In the hematopoietic system, for example, 
the hematopoietic colony-stimulating factors and interleukins regulate the production 
and function of mature blood-forming cells. Lymphocytes are dependent upon a 

10 number of cytokines for proliferadon. For example, cytotoxic T lymphocytes (CTLs) 
are dependent on helper T (Th) cell-derived cytokines, such as IL-2, for growth and 
proliferation in response to foreign antigens. (Zinkemagel and Doherty, Adv. 
Immunol. 27:51, 1979; Male et al.. Advanced Immunology, Chap. 7, Gower Publ., 
London, 1987; Jacobson et al., J. Immunol. 133:754, 1984). IL-2, for example, is a 

15 potent mitogen for CTLs (Gillis and Smith, Nature 268:154, 1977), and the 

combination of antigen and IL-2 cause proliferation of primary CIM*** T cells in vitro. 
The importance of IL-2 for the growth and maintenance of the CDS'** CTL in vivo has 
been documented in models of adoptive immunotfienqy in which the therapeutic 
efficacy of tiansfecied anti-ietroviral CDS'** ceDs is enhanrrd on siibsequat 

20 administration of IL-2 (Cheever et al., J. Exp. Med. 155:968, 1982; RedcMiase et al., 
J. ViroL 61:3ia2, 1987). IL4 and IL-7 are also capable of stimulating Ae 
pioliferation of matme CD%* CTL (Alderson et al., J. Exp. Med. 172:577, 199Q). In 
the case of IL-2, the IL-2 receptors are txpmstA on T-cells, B-cells, natural killer 
cells, glioma cells and cells of die monocyte lineage (Smith, Science 240:1169, 198^. 

25 However, the greatest level of high zfRsdty JLr2 receptor expression is observed in 

activated T-cells (Waldemann, Ann. Rev. Btocbem. 58:875, 1989). Ibe IL-2 recq)tor 
complex consists of three protein components, a low affinity receptor, a, Tac or p55 
(Leraard «t al.. Nature 311:626, 1984), an intermediate affinity receptor, 0 or p70 
(Hatakeyama, Science 244:551, 1989), and a signal transduction protein, y or p64, 

30 which interacts with the p70 receptor subunit (Takeshita et al.. Science 257:379, 1992). 
The combination of the a and ^ subunits together make vp a high affinity form of the 
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IL-2 receptor (Hatakeyama, Science 244:SS1, 1989); a-ff-y combinations appear to 
have the highest affinity (Asa et al., P.N.A.S. 90:4127-4131, 1993). 

Thus, for example, a GDFP including IL-2 can be used to target gene delivery 
specifically to activated T lyn4)hocyte5 which express high levels of 0-/7-7 affinity 
5 recepton. The cellular targets of a large number of the other cytokines are known and 
described in the reviews and other references cited above. Furthermore, following the 
approaches described in those references, any particular cell population or sub- 
population can be readily assayed for sensitivity to a given cytokine. 

The choice of a particular ligaixl may also be influenced by other activities that 

10 may be possessed by the ligand (besides binding to the cell surface). For example, 

GDFPs having B/T components derived from cytokines may possess cytokine effector 
activity that can be used to modulate the targeted cells in accordance widi the activity 
of the cytokine. Typically, GDFPs of this type will be prepared by incorporating die 
entire cytokine coding sequence into a polynucleotide encoding the GDFP; althougli it 

IS wiU also be possible to remove portions of the cytokine sequence ixli^ ^ >^ 

required for binding to the receptor iior essentid for ^tt>kiiie effector activi^ In such 3 
cases, the GDFPs can provide a combinatioa of activities ccnnprising: (0 bindiqg to - ^ 
specific target cells; (ii) delivery of targeted nucleic acid into the targ^ed cells; and < 
(iii) cytokine modulation of Ae cells thus targeted. Such a combination of activities •■^ •'■^ 

20 win allow* for exanqite, the transduction of particular celb to be coupled 

proliferation of the transduced cells. This will be generally advantageous-^in die context ^ 

of gene delivery since it can be used to promote the proliferation of the targeted cells 
in a given cell population; and will be particularly advantageous for in viyo gene 
delivery where it may be otterwise problematic or inqx}ssible to induce the targeted 

25 cells to divide, which may be necessary for effkieni. stable incorporation of the 
transferred gene. 

In some cases, it will be preferable to make use of the receptor binding potential 
of a ligand such as a cytokine wiAout concomitant effector activity. This may be tlie 
case, for example, wbeia a cytokiiK with suitabte reo^tor binding pro pe rti e s has 1 
30 negative or unwanMl effect on target cell activity. GDFPs of tfiis type can be 

prqMued, for nampfe, from cytokme sequences in wfakh tftt domain reqxmsible fior 
effector activity has be» mutationally altered by, e.g., substitution, inseitioa or 
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deletion. For example, IL-2 has been subjected to deletion analysis to identify which 
portions of the sequence are involved m receptor binding and which aie critical for 
cytokuie effector activity; see, e.g., Brandhuber, B.J. et al., J. BioL Chem. 
262:12306-308, 1987; Brandhuber, B.J. et al.. Science 238:1707-09, 1987; Zurawsld, 
5 S.M. et al., EMBO J. 7:1061-69, 1988; and Arai, K., et al., Anmi. Rev. Biochem. 
59:783-836, 1990. The receptor bmding and effector domains of a number of oAer 
cytokines have similarly been characterized; see Arai et al., id, and other reviews and 
references cited therein. 

The rapidity with which novel ligands and their cognate receptors have recently 

10 been molecularly cloned has generated a wide array of these molecules. In particular, 
the combination of direct cDNA expression cloning and screening assays for either 
induction of proliferation of binding to specific cell sur£ace ttceptors on taigei cells has 
led to mai^ new molecules being cloned (see, for exanq)le, Cosman et al.. Trends 
Biochem Sci IS: 265-269* 1990). The advent of these technologies will undoubted^ 

15 lead to the cloning of mott ligands, inchidiqg cytokmes and odier proteins that bmd to 
cells, which, on the basis of their bindiqg characteristics and specificily may be used in 
the context of the present invention as the B/T componeni of the GDFP. B/T 
components derived from the fDc-2/flt-3 ligaiid (Lyman et al.. Cell 75:1157-1167, 
1993) wiU be of mteiest because the cytxddne binds specificaify to a nccpux, fIk-2/flt- 

20 3, which is expressed on early hematopoietic cells (Matdiews, W. et al.. Cell 65:1143, 
1991; and Small et al.. P.N.A.S. 91:459-463, 1994). In the context of the present 
invention, GDFPs conqirising a B/T conqK>nent derived from the flk-2 ligand couU 
thus be used to direct gene delivery to lymphohematopoietk stem cells. 

While the foregoing principles have been illustrated using cytokines as a 

25 convenient exanq^le, these principles are also applicable to other ligands cdpMc of 
binding to cell surfaces, inchidmg for exanq>le, antibodies, lectins, viral bindmg 
proteins, cellular adhesion molecules, and apy other proteins that associate with ceUular 
surfices. 

Fdr exanq)fe, a large number of antibodies to cell surfiioe antigens have been 
30 identified and described. Antibodies to leuko^tes have been well duuacterized and 
classified as the "CD* series of antigens; see, e.g., Coligan, J. et al. (ed.), "Current 
Protocols in Immunology," Current Protocols, 1992, 1994. Moreover, ttrhniqiifts for 
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the isolation of new antibodies specific for a particular target, cell are routine in the art. 
Useful antibodies will be those which interna with antigens on the surface of the 
desired target cells. Antibody/antigen binding can be readily determined and monitored 
by flow cytometry or other inununochemical detection methods. 
S Of particular interest are antigens that are exclusively or preferentially expressed 

on the surface of particular target cells. For example, the CD34 antigen is expressed 
on human lymphohematopoietic stem cells (Andrews et al.. Blood 80:1693-1701, 
1992). 

Transferrin, (see, e.g., Zenke, M. etal., P.N.A.S. 87:3655-3659, 1990), can 
10 also be used as a B/T component in the context of the present invention. 

Targeting to certain cells, for example respiratory epithelial cells, can also take 
place via immunoglobulin (Ig) recepton (see, e.g., Ferkol, T., J. Qin. Invest. 
92:2394-2400(1993). 

The GDFPs of the present invention can also be chemically modified, for 
15 example by the addition of lactose to target die GDFP to asialoglycoprotein receptors 
and thus to hepatocytes of the liver (see, e.g., Neda, H. et al., J. Biochem. 266:14143- 
14146, 1991). 

Another group of proteins from which the B/T components can be derived are 
lectms. A number of such molecules, and their cognate receptors, have been identified 

20 and characterized (see, e.g., tl» review by Lis & Sharon, Aim. Rev. Biochem.* 55:35- 
67, 1986; and publications cited therein). 

Proteins ciqiable of targeting the GDD and dms the GDFP/tNA complex to cell 
surfaces can also be derived from viruses. Many such viral proteins capable of binding 
to cells have been identified, ii^hiding, for exuaplc, the well-known envelope ("env") 

25 proteins of retroviruses; hemagglutinin proteins of RNA viruses such as die influenza 
virus; spike proteuis of viruses such as the Semliki Forest vinis (Kfelian and 
Jungerwirdi (1990) Mol. Biol. Med. 7:17-31); and proteins £rom nothenveloped yiruses 
sudi as adenoviruses (see, e.g., Wickham et al.. Cell 73:309-319, 1993). 

As an iUustrative exanqile, in die murine leukemia virus (MuLV) system, it is 

30 well known that the amino-terminal region of dK gp76 molecule is involved in binding 
to cell smftoe receptors, see, e.g.. Heard and Danos, J. Vind. 65: 4026-4032, 1991. 
Batdni et al., J. Viiol. 66: 1468-1475 (1992) have also reported diat portions of die 
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amino-tenninai region of gp70 can be exchanged in oider to switch binding to different 
MuLV env receptors without interfering with the abili^ of the protein to interact with 
pl5E TM protem (and, thereby, to mediate viral uptake); see also Weiss, R. et al« in 
Weiss, R. et al. (eds.). RNA Tumor Viruses, Cold Spring Harbor, New York (1984 
5 and 1985). Similarly, in the hunuui inmiunodeficiency virus (HIV) system, mutational 
analysis of gpl20 has identified portions of the molecule which are critical for binding 
to the CD4 receptor, see, e.g., Kowalski, M. et al.. Science 237:13SM3S5, 1987. 
Yet another approach to identify a region critical for receptor binding is as follows: an 
antibody known to inhibit binding can be used to immuno-affinity purify a cleavage 

10 fragment of the viral binding protein; which fragment is then partially sequenced to 

identify the corresponding domain of the viral binding protein, see, e.g., Laskey, L.A. 
et al.. Cell 50:975-985, 1987. Such techniques can be employed in the presem 
invention to generate GDFPs in ^ch the M-D component remains cspMt of 
mediating uptake of the GDFP/tNA complex (as described below), but the specifidty of 

15 bmdmg is principally determined by die presence of, e.g., cognate ^tokine recqitors 
correspcmding to a portion of the B/T craoponent, rafter than viral binding protem 
recqptors* 

Another iUusirative example of a viral protein that can be used is the G protein 
of VSV, which has been utilized to target infection by retioviial vectors; see, e.g., Emi 
20 et al., J. Virol., 6S:12Q2*12Q7, 1991. 

Another group of proteins from which the B/T components can be derived are 
cellular adhesion molecules. A number of such molecules, and their cognate recqrtors, 
have been identified and characterized (see, e.g., Sprii^, T., Nature 346:425-434^ 
1990, and publications cited therem). 

25 

(2\ MembT ^pg-Pjspj ptiny (M>D^ Components 

Membrane-disrupting conqionents are protein seqmnces c^M^le of locally 
disnqrting celhilar membranes such that the GDFP/tNA complex can traverse a ceUular 
membrane. 

30. M-D conqponents faciUtatiiig iqytake of the GDFP-target^ 

by target cdls are ^kally menibrane-active regkms of protein structure haviqg a 
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hydrophobic character. Such regions are typical in membrane-active proteins involved 
in facilitating cellular entry of proteins or particles. 

For example^ viruses commonly enter cells by endocytosis and have evolved 
mechanisms for disrupting endosomal membraiKS. Many enveloped viruses encode 
5 surface proteins capable of disrupting cellular membranes including, for example, 

retroviruses, influenza virus, Sindbis virus, Semliki Forest virus. Vesicular Stomatitis 
Virus, Sendai virus. Vaccinia virus, and mouse hepatitis virus; see e.g., Kielian and 
Jungerwirth, Mol. Biol. Med., 7:17-31, 1990; and Marsh & Helenius, Adv. Virus 
Res., 36:107-151, 1989. The mechanism for viral entry, in which a viral binding 
10 protein binds to a specific cell surface receptor and subsequently mediates vinis entry, 
frequently by means of a hydrophobic membrane-disruptive domain, is a common 
theme among enveloped viruses, inchiding infhienza virus, and mai^ such molecules 
are knawn to those sidlled in the ait, see, e.g.. Hunter and Swanstrom, Curr. Top. 
Micro, and Immmiol. 157:187, 1990; and the review by White, J., Science 258:917- 
15 924, 1992; and publications reviewed therein. 

By way of iUustration, the M-D components of the present invention can thus be 
derived from a portion of a vual bindmg protein diat is normally involved in mediating 
uptake of the virus into a host cell, or a mutein of such a portion of a binding protein. 
The portion of the GDFP that may be derived from such a viral binding protein may, 
20 but need mt, also contain the portion of the binding protein that causes the viral 

particle to associate with a specific receptor on a target cell (which latter portion may 
thus function as a B/T component, as described above). A large number of viruses 
have been characterized and, for maiiy of these, the nucleotide sequence of the viral 
genome has been published. The binding proteins encoded by various viruses geheraUy 
25 share functional homology, even though there may be considerable variation among the 
primaiy amino acid sequences. Using the retroviruses to ilhistrate, the native SEBL gene 
product is typically a polyprotein precursor that is proteolyticaUy cleaved during 
transport to the cell surface to yieU two polypeptideis: a glycosylated polypqitide on the 
external surftce (the "SU" protein) and a membrane-sparmmg or traasmembrane 
30 protem (the 'IM' protein); see, e.g.. Hunter. E. and R. Swanstrom, Cmr. Topics 

Microbiol. Immunol. 157:187-253, 1990. The SU proteins are responsible for binding 
to spcci& reoquors on the surface of target cells as a first step in the infection 



WO^E/28494 



PCT/US95AM738 



-28- 

process. The TM proteins, as well as associating with viral core proteins through their 
C*terminal ends, are responsible for a critical membrane fusion event which takes place 
after binding and allows entry of the virus into the cell (See, e.g.. Hunter and 
Swanstrom (1990) Curr. Top. Micro, and Immunol. 157:187; Kielian and Jungerwirth 
5 (1990) Mol. Biol. Med. 7:17-31; and Marsh & Helenius (1989) Adv. Virus Res., 
36:107-151). The membrane fusion event is accomplished by a hydrophobic 
polypeptide sequence present at the amino terminus of the TM protein. Examples of 
these pairs of SU and TM proteins and the viruses which produce them are: gp52 and 
gp36 from nu)use mammary nmior virus (Racevslds, J. et al., J. Virol. 35:937-48, 

10 * 1980); gp85 and gp37 from Rous sarconm virus (Hunter et al., J. Virol. 46:920, 1983); 
gp70 and pl5E from Moloney murine leukemia vims (Koch et al., 49:828, 1984); gp70 
and gp20 from Mason Pfizer monkey virus (Bradac, J. et al.. Virology 150:491-502, 
1986); gpl20 and gp41 from human immunodeficiency virus (Kowalski, M. et al.. 
Science 237:1351-1355, 1987); and gp46 and gp21 from human T*Cell leukemia virus 

15 (Seiki el al., Proc. Natl. Acad. Sci. 80:3618, 1983); and others described m the 
references cited herein. The functional simibri^ among these types of proteins, is 
ftirther ilhistrated tiy the well-documented phenomenon of "pseudotyping," in which the 
core proteins and nucleic acul are provided by a first virus and the envelope proteins 
(detenmning host range) axe provided by a different virus (see, e.g., Vik el al., 

20 Virology 180:420, 1991; Miller et al, J. Virol. 65:2220, 1991; and Landau et al., J. 

Virol. 65:162, 1991). Examples of retroviruses which can be used to derive fragments 
for use in the present invention include murine retroviruses such as Harvey murine 
sarcoma virus (Ha-fMSV), Kirsten murine sarcoma virus (Ki°MSV), Moloney m^"^ 
sarcoma virus (Mo-MSV), various murine leukemia vinises (MuLV), mouse mammary 

25 tumor virus (MMTV), murine sarcoma vims (MSV) aixl rat sarcoma virtis Q(aSV); 

bovine leukemia virus (BLV); feline retroviruses sudi as feline leukemia virus (FeLV) 
and feline sarcoma virus (FeSV); primate retrovinises such as baboon endogenous virus 
(BaEV), human immunodeficien^ viruses (HIV-I and HIV-II), human T-cell leukemia 
viruses (HTLVrl and HTLV-IO, Gibbon dipt leukemia virus. Mason Pfizer monkey 

30* viius (M-PMV), simian immunodeficiency vinis (Sr^ 

(SSV); various lentivinises; and avian retroviruses such as avian eiytfaroblastosis vinis, 
avian leukosis vims (ALV), avian myeloblastosis virus, avian sarcoma vims (ASV), 
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avian reticuloendotheliosis-associated virus (REV-A), Fujinami sarcoma virus (FuSV), 
spleen necrosis vims (S^fV) and Rous sarcoma vims (RSV). Many other suitable 
retroviruses are known to those skilled in the art and a taxonomy of retrovimses is 
provided by Teich, pp. 1-16 in Weiss et al.. eds, RNA Tumor Vimses, 2d ed.. Vol.2, 

5 Cold Spring Harbor, New York. Plasmids containing retroviral genomes are also 
widely available from the ATCC and other sources. 

Infectious virions have also been produced when oSB-retroviral binding proteins^ 
such as the G protein of vesicular stomatitis virus or the hemagglutinin of influenza 
vims, have been pseudo-typed onto retrovirus cores (see, e.g., Emi et al., J. Virol. 

10 65:1207, 1991; and Dong et al., J. Virol. 66:7374, 1992). These latter examples 

indicate that there are functional commonalities between various vimses and their mode 
of entry into cells which will aUow the use of viral binding proteins from a varieQf of 
sources. Influenza hemagglutinin has also been reported to enhance the iqitake of poly- 
L-lysine-based chemical conjugates (Wagner et al.» P.N.A.S. 89:7934-7938, 1992). 

IS The sequences of a large number of viral binding proteins are known, and are 

generally available firom sequence databases such as GENBANK. Furthermore,* 
polynucleotides encoding viral binding proteins can be readily obtained from viral 
particles thonselves. Also, since many different genes encoding viral binding proteins 
have been cloned and characterized, plasmids containing DNA encoding the binding 

20 proteins are available from a number of different sources: Polynucleotides encoding 
viral biiKling proteins can also be obtained using polymerase chain reaction (PGR) 
mediodology, as described, for exan^le, by MuUis and Faloona (1987) Medi. 
Enzymology 155:333. 

As an illustrative embodiment of the present invention, a GDFP may con:Q>rise a 

25 region of a gene encoding a viral binding protein inchiding a B/T 

case the GDFP can be used to target cells inchiding those normally susceptible to the 
virus from winsk the gene was derived. In other anbodiments of die present 
invention, the targeting may be restricted to cells beariqg receptors for other types of 
Uganda, discussed above under the description of the B/T component For example, 

30 ' whoe an M-D component is derived from a viral binding protein diat retains the abiUty 
to bind to the viral receptor, but it is desirable to limit targeting to cells bearing, e.g., 
an s^)propriate cytokine receptor, there are several approaches that can be used to 
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achieve such specifici^. One approach is to utilize a B/T component which is based 
on a cytokine with a veiy high binding affmiQr for the desired target cells compared to 
the binding affuity of a domain derived from a viral binding protein for the native viral 
binding protein receptors. Since many of die cytokines are known to exhibit very high 
affinity binding to their receptors, and since it will be feasible, for example, to base the 
M-D component on a lower-binding-affinity viral binding protein, targeting can be 
effectively focused upon those cells bearing a cognate cytokine receptor. Another 
suitable approach to limiting binding is to derive the M-D con^nent from a mutant 
viral binding protein in which the mutation has disnqned the abiliQr of the protein to 
engage in binding via the native vkal binding protein receptor but has not interfered 
whfa the ability of the vhal binding protein to mediate viral intake, nasmids encoding 
sttdi mutant viral binding proteins are available intheart;anditwinalsobe wdl 
within die abili^ of one skilled in the ait to prqnre new versions of such viral binding 
protem mutants deleting portions of die codmg region bt by introducing amino acid 
substitutions mto the coding sequence as described above. 

While die foregoing principles have been iUustrated ushig viral proteins as a 
convenient example, diese principles are also appUcaUe to odier polypeptides capable 
of disrupting cellular membranes (sec, e.g., die review by White, J., Science 258:917- 
924, 1992, and publications reviewed therein). 

Odier domains that are functionally and/or strucoiraUy analogous can be derived 
from various viral, prokaiyotic or eukaiyotic sources. As a fiirdier specific exanpte, 
bacterial toxins such as diphdieria toxin have a specific domain widi a highly 
alpha-helical strucoue and a hydrcqdiobic diaracter (known as die "TM'' domain in die 
case of d^hdwria toxin) diat becomes protonated at low pH and disrupts cellular 
membranes, facilitating entry of die toxin mto die cells (see, e.g., Choe et al. (1992) 
Nanne 357:216-222; vandeiSpeck et al, J. Biol Chem 268: 12077-12082, 1993; and 
Parker & Pattus, Trends Biochem. Sd. 18:391-395. 1993). Toxins such as 
Pseudtmonas exotoxin A have a similar membraDe-disnqiting domain (see, e.g., Strom 
et al., Ann. N.Y. Acad. Sd. 636:233-^50, 1991). Similar M-D components can be 
derived from odier bacterial toxins such as hemdysin (Sutioip et al.. J. Exp. Med., 
178:337-341. 1993). As described berdn. indudon of such membrane disruptive 



W0 95/284M 



PCTAJS95y04738 



-31- 

components in the GDFP would facilitate membrane disniption and entry of the 
GDFP-tNA complex into the target cells. 

Cytolytic pore-forming proteins, such as streptolysin 0« perforins expressed by 
cytotoxic T lymphocytes, and S.aureus alpha toxin also have the abUity to disrupt 
5 membranes (see, e.g.. Ojcius and Young, Trends Biochem. Sci., 16:225-230, 1991; 
Suttorp et al., J. Exp. Med., 178:337-341, 1993). Streptolysin O has been shown to 
facilitate uptake of DNA by cultured cells when added to the culture medium (Barry et 
al., Biotechniques 15:1018-1020). There are mai^ bacterial cytolysins which have the 
capability to induce membrane disruption (see, e.g., Braun and Focareta., Crit. Rev. 

10 ' Microbiol. 18:115-158, 1991; and van der Goot et al.. Nature 354:408-411, 1991). 
Membrane disruption often occurs by means of a pH-induced hydrophobic change in 
the protein, but diis can also occur by en^matic means, such as those involving 
phospholipases (see, e.g, Braun and Focareta., CriL Rev. Microbiol. 18:115-158, 1991 
(and references cited therein); and London, Mol. Microbiol. 6:3277*3282, 1992). 

15 Where a pH shift is required to induce die xneinbrane disniption function, there are 
several ways in which this can be achieved. For example, the GDFP/tNA complex 
may be taken up through acid endosomes; or the pH of the extraceUular medhm may 
be transiently lowered to mediate activation of the membrane di5nq)tion function. In 
some cases (dqphtheria toxin for example), enzymatic nicking of the membrane active 

20 component prior to an induced pH change m the surrcRmding medium is believed to 

promote membrane disruption (see, e.g., Sandvig and Olsnes, J. Cell Biol. 87:828-832, 
1980; Moskaug et al., J: Biol. Chem. 263:2518-2525, 1988; and Zaiman and 
Wisnieski, Proc. Natl. Acad. Sci. 81:3341-3345, 1984). WeU-known enzymes such as 
trypsin and urokinase have been successfuUy used to provide the liicking ac^ 

25 vitro (see, e.g., Williams. D.P., et al., J. Biol. Chem. 265:20673-20677, 1990). 
Enzymes cqiabte of providing die iiickiiig.activiQr are also kno^ 
cellular surfaces (see, e.g., Williams, D.P., et al., kl.). Exenq>lary construction and 
characterization of GDFFs containing the dq>dieria toxin transmembrane region are 
described below in Example 8... 

30 Other sources of M-D components inchute bacterial proteimAatp 

of organisms into cdls, socfa as die 52kD entry protem of Mycobacterium tuberadasis 
(Arruda et al.. Science 261:1454-1457, 1993); the imemalm protein dfLisuria 
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monocytogenes (Portnoy et al.. Inf. Imm. (U.S.) 60:1263-1267); and the invasin 
protein of Yersinia enterocolitica (Young et al., J. Cell Bio., 116, 197-207), among 
others. 

Synthetic analogs of membrane-disrupting domains can also be made. See, e.g., 
5 Kaiser and Kezdy, Science 223:249-255, 1984. 

n\ TmnrnftTiA^lization rr/L) Components 

Transport/localization components mediate or augment Ac transport and/w 

localization of the GDFPANA conq>lex to a paiticttlar sub-ceUular conqwrtment such as 

10 the nucleus or mitochondrion. 

A number of sequences that mediate tian^KXt and/or localization of proteins 

have been identified. These inchuk, by way of iUustiation, the nuclear localization 

sequence (nls) of SV40 T antigen (CoUedge, et al., Mol. Cdl Bio. 6:4136-4139, 1986); 

and the HIV matrix protein (Bukrinsky et al., Nanne 365:666-669, 1993). These are 

15 typically short basic peptide sequences, and may also be bqfiartite basic sentences (see, 
e.g.. Garcia-Bustos et al., Biochim. Biophys. Acta 1071:83-101. 1991; and Robbins et 
al.. Cfcll 64:615-623, 1991). Nuclear localization sequences have been fused to 
heterologous proteins and shown to confer on them the properQr of nuclear localization 
(see, e.g.. Biocca et al., EMBO J. 9:1Q1-108. 199(9. In the case of die human 

20 estrogen receptors, for exanq)le, fusion proteins trafRc to the nucleus in an estrogen- 
dependent &shion (Ishibashi et al.. J. Biol. Chem. 269:7645-7650. 1994). Hiese 
fifqtwurfff can be readily incoipotated into the GDD by recombinant DNA mediodology 
to facilitate nielear localization of the desired GDFP/tNA con^lex. GAI4 has also 
been shown to possess nuclear localization properties in yeast (see, e.g.. Silver, et al.. 

25 P.N.A.S. 81:5951-5955, 1984), and tims, as a component of a ODFP, GAL4 may be 
used as both an NBD and a <H>D with a role in tranqKnt/locdUzatian. 

^4^ Replied In**yr»riftii nm Qmmonenti 

RepUcon intqptation conqmnents mediate ot augment integration of tiie targeted 
30 nucleicacidinmar^iiconoftlietaisetcdl, such as a chromosome. In tnaogr 
instances in gene transfer and gene ibenpy it is advaniageoas to obtain staUe 
integration of transferred DNA into die genome of tbe target oeU. TlieGDFPean 
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facilitate such integration. Also, as described herein, a particular advantage of the 
Type I GDFP approach is that it not only allows the stoichiometric attachment of 
delivery components to the tNA, but also allows the GDFP to be positioned at pre- 
detennined locations with respect to the tNA. In the case of a replicon integration 
5 component such as an integrase, the GDFP can be positioned in proximity to terminal 
integrase recognition sequences as a means of facilitating integration, as described in 
more detail below. 

DNA-protein interactions can mediate integration of DNA into the mammalian 
genome. For example, the integration of all known retroviruses takes place in an 

10 enzymatic reaction that makes an endonucleolytic cleavage of the host DNA and ligates 
die leverse-transcribed retroviral genome to the finee ends of die host cell DNA. This 
reaction is mediated by the retroviral integrase (or "IN") protein, and it is well known 
that the IN protein interacts widi a minimal number of bases present on the ends of the 
pre-integrative viral genome to achieve integration. Indeed, DNA sequences beariqg 

15 the IN sequence recognition motif can be inserted into free DNA in vitro by purified 
IN proteins (see, e.g., Bushman et al.. Science 249:1355-1558, 1990; and Katz et al., 
CeU 63:87-95, 1990; see also. Brown et al. Cell 49:347-356, 1987; and Roth et al.. 
Cell 58:47-54, 1989). For example, die MLV, HIV and RSV IN proteins are each 
kmiwn to mtexzct widi a distinct ^bon IN sequence recognition motif present at each 

20 end of the linear pre-integrative viral DNA substrate to mediate its mtegration into the 
host cell replicon. In vitro mtegration mediated by purified IN protein has been*, 
demonstrated usiiqg either free oligonucleotides or synthetic DNA substrates bearing the 
IN recognition sequence motif (see, e.g., Katz et al., siq>ra; and Bushman and Craigie, 
J. Vbxd. 64: 5648, 1990). Synthetic DNA substrates can be readily engineered by 

25 mserting a unique restridlcm en^me site (typicaUy Mid), flanked by Ae appropriate 
IN recognition seqiienDes, into a plasnud vector. Digestion of the vector wifli Ndd 
yiekls a DNA substiate witii 3* recessed ends preceded by the highly consnved 
5*CA-0H dinucleotide and the remamder of tiie appropriate IN recognition motif, 
which resembles the processed ends of dxf pre-integrative viral DNA widi which IN 

30 interacts to mediate integiatioiL Tlus approach has been suooessfulty used to 

demonstrate m vitio mtegration of such finearized DNA substrates mto double-stranded 
DNA in vitro by purified recombinant avian retrovirus IN (Katz et al., supra), 
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MoMLV IN (Bushman and Craigie. J. Virol., supra) and HIV IN (Bushman et al.. 
Science, supra). The IN recognition sequences used on the tennini of the substrate 
DNA in these experiments were short (10-30 base pairs), demonstratii^ that 
heterologous DNA substrates with short terminal IN sequence recognition motifs can be 

5 integrated into double-stranded DNA by IN. Moreover, these experiments document 
successful integration of both ends of the DNA substrate into the target DNA, as 
opposed to the oligonucleotide integration experiments which assay only for integration 
of a single end of the substrate DNA. These experiments document that linear DNA 
substrates beaiii^ diort terminal IN recogniticm motifs can be integrated into 

10 double-stranded DNA in vitro by purified IN protein. Tims, die foregoing experimnts 
provide further evidence of the utili^ of the piesett inventioii. in that IN c(^^ 
can be inchided in the GDFP and can act in concert widi terminal IN recognition 
setpience motife present on the (substrate) iSA to mediate efincient huegratian. 
Recombinant fuaons between imegiase and heterologous proteins have previous^ been 

15 constructed, expressed and shown to retain integcase enzymatic activiiy (see. e.g.. Vudc 
et al., J. Virol., 68:1468-1474. l»4). Moreover, Bushman (PNAS 91:9233. 1994) 
has shown that recombinant fusions can be nude between integrase aid a sequence- 
specific DNA bindnig protein, and diat such fusions retain integrase activiQr and 
sequence-specific DNA binding. 

20 - Thus, for example, by including an RI component derived from an integrase 

protein in the GDD, and using a tNA bearing the IN recognition sites, the GDFP can 
be co-imxoduced widi the targeted nucleic acid (tNA) bearing the integraticm 
recognition motif and dnis achieve integration of die tNA intt> a repUcon of the target 
cell. This system would also allow, in ooiguncdon witti an i^prcqwiaie bindiiig doo^ 

25 in die NBD. far the association of the Rlconqionentwidi die fitee ends of die tNA. 
This would be advantageous smce the IN proteins of v^virases fahctian to mediate 
integration at the fine ends of pre-integntive vital DNA. In dtt piesent bivendon, dus 
can be achieved by utilizing a TVpe-I GDFP in coiyunction widi a linearized tNA 
containing die cognate recQgiiition sequence ("CRS") for dK NBD at (or in ckMe 

30 proxnnity to) dK ends of die tNA beating die teiimnal IN sequence lecognitionniotif 
(preferably less dian 500 nucleotides from tbe IN sequence, mote prefinably less dnn 
200 nucleotides, most preferably less dian 50 nucleotides). To generaliB did tNA iEbr 
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gene delivery, the tNA can be consmicted, for example, with a unique Ndel site 
between the two IN recognition motifs, flanked by the cognate recognition sequence of 
the NBD. Digestion with Ndel would then generate a linear DNA molecule with 3* 
recessed ends preceded by the 5*CA-0H dinucleotide, the remainder of the IN 
5 sequence recognition motif, and the CRS for the NBD, in that order. In this way, the 
terminal IN recognition sequences would be closely linked to the cognate recognition 
sequence for the NBD. Typically, the Ndel site/IN recognition sequence/CRS cassette 
would be inserted into the piasmid backbone of the vector. However, it is possible to 
construct a tNA devoid of any extraneous or undesirable sequences; for example, a 
10 tNA devoid of ai^ bacterial piasmid sequence can be generated by flanking each end of 
the mammaKan expressiou cassctte in the tNA with a CRS, followed by one half of the 
IN recognition sequence, followed by an Ndel site. Digestion by Ndel would then 
generate a linear tNA DNA fragment, which could be readily purified from the piasmid 
backbone fragment, havmg on each end the IN recognition sequence and die CRS. 
IS Removal of piasmid backbone sequences may be desirable to achieve c^timal g^ 
regulation in the transduced cells. Binding of the GDFP would then locate the RI 
component containing the IN region in close proximity to the sites on the tNA with 
which it can n^diate ef&ient integration. An analogous strategy can be used with Itae 
AAV Rep protein and viral ITRs (see, e.g., Owens et al., J. Virol. 67:997-1005 
20 (1993), and the review by Carter, B.J., Current Opinion Biotech. 3:533-539 (1992) and 
publications reviewed therein). Additionally, odier recombinase systems such as die 
bacteriophage PI ere recombinase, the yeast FLP recombinase, the yeast SRl-derived 
R recombinase and the Tyl integrase (see, e.g., Kilby et al.. Trend Genet., 9:413-421, 
1993; Moore and Garfinkd, PNAS 91:1843-1847. 1994) can be used in the context of 
25 the present invention using fusions with appropriate NBDs, cis*acting recombinase 
recognition sequences and CRS elements, in an analogous fashion to that desc ri bed 
above for Oe integrase fusions. 

Multimerizing the RI domain may be required tor optimizing integration 
activity, especially in situations in which the protein 
30 derived functions in multimeric form. Thus, for example, many native retroviral IN 
proteins art dhneric or multimeric in form (see, e.g., Jones et al., J. Biol Chem 23: 
16037, 1992; levfewed in Skalka, Gene 135:175. 1993). Multimcrization of the IN 
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domain can be conveniently achieved by, for example: (a) constructing tandem repeats 
of the IN component in the GDFP, preferably separated by a flexon; (b) dimerizing the 
GDFP by insertion of a protein dimerization motif, e.g., a leucine zipper motif (see, 
for example, Hu et al., 1990, Science 250: 1400); (c) adding free IN protein to an IN* 
5 containing GDFP (since IN proteins have a naniral tendency to multimerize); or (d) 

multimerizing the CRS in the tNA such that multiple GDFP molecules bind to each end 
of the tNA. Combinations of the above strategies can also be used. This would result 
in further multimerization of the IN component and thus a more active integration 
complex. 

10 Another strategy to achieve multimerization of the RI domains, and also to 

achieve more efficient concerted integration of the tNA (i.e. having both ends of the 
tNA integrate into the replicon), would be to engineer the system to bring the free ends 
of the linear tNA together. This can be achieved in a number of ways in the context of 
ibt present invention. First, die GDFP monomers can be designed in such a way that 

IS they self-dimerize, using a leucine zippcx or other motif as described above. Thus, 

dimerization of the GDFP bound to the tNA would result in close qiposition of the free 
ends of the tNA, since the CRS is located near fliese tennini. A second qyproach 
would be to use a second, separate DNA bindiqg protein widi additional cognate 
recognition sequences piesenl near the tennini of the tNA to bripg die ends of the tNA 

20 together (for this purpose any of the DNA binding domains alluded to above could be 
used in dimeric form together with a tNA having the appropriate cognate recognition 
sequence to associate the free DNA ends). Such strategies would bring the free ends of 
the tNA in close q^position to one ?r^th^ and thus may furtiser enhance the frequency 
of concerted integration. Several approaches can be used to avoid any potential 

25 problem that may arise from GDFP/tNA conq>lex auto-integration (i.e. integration of 
the ends of the tNA molecule into itself), or cross-integration of one tNA molecule into 
anodier. Although con^lexii^ of the GDFP with the tNA could be done at 4"C, thus 
reducing IN enzymatic activity, placing Ae complex onto cells at higher temperatures 
could lead to such unwanted integration events. A pi efened qyproach is to use a 

30 . concfitional RI moie^. Such conditional RI moieties can be dependent on dnnical or 
protein co-factors, or can be mutants that are conditional for full activity dependent oa 
tenqperature or other variables, such as die presence or absence of inhihilon or co- 
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factors. For example, in the case of IN, temperature-sensitive (ts) IN mutants have 
been made that are active only at certain temperatures (see, for example, Schwartzberg 
et al.. Virology 192:673, 1993). The use of a RI component would allow exposure 
to and uptake of the complex by cells to be done at the non-pcnnissive temperature 
S (such that the RI component would not be active), followed by switching to the 

permissive temperature once the complex was taken up into the nucleus, allowing the 
RI component to be active m the context of the host cell replicon and dnis accomplish 
the desired integration. 

Thus, inclusion of an RI component in the GDFP can be used to enhance 

10 frequencies of integration. The GDD can consist of an RI component alone, or it can 
in addition comprise one or more of the other compoxmoss discussed above. Where the 
RI conqxmem is the sole component in the GDD, the NBD would function to associate 
the RI component more stably and/or more specifically with the free ends of die tN A 
than is possible through, for exanq>le, use of ihe recombinant native IN protein alone. 

IS By virtue of the NBD binding, the RI moie^ is more tightly associated widi the tNA 
termini during die transfection process and can mediate integration into the host cell 
replicon. Where the RI component is the sole component in the GDD, the tNA/GDFP 
complex can be delivered by any of the standard means of transfection, such as 
lipofection, electroporation, etc., and die resulting cells would have an enhanced 

20 frequency of stable gene delivery as a consequence of enhanced integration of the tNA. 
Alternatively, the complex can be delivered by other non-viral means, including for 
example die use of self-assemblmg systems such as viral cq>sid proteins. Experimental 
evidence has confinned that viral capsid proteins can be used to introduce DNA into 
mflmnifliiflti cells (sce, e.g., Forstova et al., Huni. Gene Ther., 6:297-306, 1995). 

23 In certain cases, such as the retroviral IN or AAV Vlep prdtems* components of 

the GDD can also function as effective NBD conqwiK^ 
function in the GDFP virtue of their abili^ to bind nucleic acid (see, e.g., 
Krongstad & Champoux, J. Virol. 64:2796*2801, 1990; Owens et al., J. Virol. 67:997- 
lOQS (1993); and die review by CaitCT, B J., Cuxxent 

30 (1992) and publications reviewed dnein). 
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1. The Targeted nucleic acid ON A) 

The targeted nucleic acid (tNA) is a polynucleotide, or analog thereof, to be 
delivered to a target cell. Thus, targeted nucleic acids include, for exanq)le, 
oligonucleotides and longer polymers of DNA, RNA or analogs thereof, in double- 
stranded or single-stranded form. The tNA may be either circular, supercoiled or 
linear. A preferred example of a targeted nucleic acid is a DNA e^^ression vector 
comprising a gene (or genes) of interest opmbly linked to a transcriptional control 
region (or regions) and a cognate recognition sequence capable of being bound by the 
NBD ^""mh of the GDFP. The transcriptional control region may be selected so as to 
be specifically activated in the desired target cells, or to be responsive to specific 
ceDular or odier stimuli. 

Targeted nucleic acids may also include, fot example, positive and/or n^ative 
selectable markers; thereby allowing the selectioa for and/or against cells stably 
expressing the selectable marioer, other in vitro or in vivo. 

Use of the present invention to deliver RNA would enable tbe introduction of 

RNA decoys (Sullenger et al., Cdl 63:601-608. 1990); ribozymes (Yamg et al., CeH 
67:1007-1019, 1991); and antisense mideic acids (Vk±en et al.. Nucleic Ackl Res., 
19:3359-3368), for example. 

In Type-I GDFPs. the targeted nudeic adds are recognized and bound by die 
GDFP by virtue of specific cognate recognition sequences to «4uch die nudek add 
binding domain (NBD) of the Type-I GDFP binds. Both DNA and RNA binding 
domains have been isolated from proteins that bind to particular nudek: acids in a 
sequence-specific fashion. Inchision of such a cognate recognition sequence in the 
targeted nucleic add aUows for specific bindmg of the GDFP to tbe tNA. Recognition 
sites for maqy nuddc ackl binding ^oieins have been identified (see. for example. 
MildieU A Tjian. 1989; and other lef eience s herein). 

Binding of sequence-qiecific binding proteins to DNA tends to be more avU 
when die rccbgnitkm sequence motif is mnltimerized (see. e.g.. HochscfaiU and 
Ptasfane. CeU 44:681-687, 1986). Accordingly, the cognate recognidon sequences toy 
be muhimerized in the targeted nuddc adda so as to enhance the binding affinily or 
selectivity of a GDFP for its cognate tNA. This could also have other advantages, 
such as increasii« the eftetive amoutt of tte GDFP boinid to tte tNA. or pro^^ 
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compaction/condensation of the tNA by sequence-specific or sequence-non-specific 
NBD compoKnts. 

Typically, but not necessarily, the cognate recognition sequences in expression 
vectors will be placed in the plasmid backbone of the vector. This also applies to other 

5 cis-acting sequences that are needed in the tNA to facilitate gene delivery. However, it 
may be desirable to remove plasmid backbone sequences from the DNA to be 
transferred. In this case, die expression cassette can be conveniently flanked by 
restriction enzyme sites, such that restriction enzyme digestion separates Ae backbone 
from the mammalian expression cassette, lie expression cassette can then be purified 

10 away from the plasmid backjbone for use in transduction experiments. Oeariy in this 
case the CRS would need to be located on the fragment bearii^ the eq)ression cassette. 
It is also possible, of course, to construct the GDFP so as to bind to more than one 
tNA. 

As discussed above, the tNA can also be bound to the GDFP via sequence-non- 

15 specific interactions in addition to sequence-specific interactions. In a Type-I GDFP, 
such sequence-non-specific mteractions can be mediated by auxiliary components 
derived from sequence-non-specifk binding proteins, as discussed above. Such 
auxiliary non-specific binding conqxments can also serve to compact or otherwise 
reconfigure the targeted nucleic acid; see, sapoL. 

20 The targeted nucleic acids can also include, for example, non-expressed DNA, 

such as sequences homologous to sequences present m a target cell replicon, that can 
thereby mediate homologous recombination. This can be used to facilitate the stable 
integration of the targeted nucleic acid, or a desired portion thereof, into a specific site 
in a replicon present in die target cell, such as a specific site m a ceUultt chromoscmie. 

25 this may be useful, ftxr example, to adiieve a desired level of expression of die tNA 
by integration at a deitired chromosomal site. Homologous recombination can also be 
used to alter a speaGc DNA sequence in a target cell rq>licpn (see, e.g., Thomas Sc 
Capecchi, CeU 51:503-512, 1987). 

For longer tNA sequences, or where the tNA uptake mechanism (wiiedier part 

30 ofthe GDFP or iK>t) is loKiwn or suspected to be sensitive to the size, form orchard 
of nucleic acids and/or complexes to be delivered, such as mechanisms mvohrii^ 
endocytosis, it may be desurable to condense and/or charge neutralize tbe tNA. Tlus 
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can be achieved by mixing the tNA with any of a number of proteins or other agents 
(collectively referred to as "compacting agents") that can condense and/or charge 
neutralize nucleic acids. Compacting agents include, for example, histones (see, e.g., 
von Holt, Bioassays 3:120-124, 1986; and Rhodes, Nucleic Acid Res., 6:1803-1816, 

5 1979); or polypeptides derived therefrom (Rodriguez et al., Biophys. Chem., 39:145- 
152, 1991); as well as the non-histone high mobility group proteins. Poly-L-lysine or 
other polybasic amino-acids can also be used as compacting agents (see, e.g., Li et al.. 
Biochemistry, 12:1763-1772, 1973; and Weiskopf and U, Biopolymers 16:669-684, 
1977). Similarly, other polycationic polymers such as polyamines, for example 

10 spermine and spermidine, and cationic lipid-containing polymers can also be used to 
condense and/or charge neutralize nucleic acid (see, e.g., Feuerstein et al., J. Cell. 
Biochem., 46:37-47, 1991; and Behr, Bioconj. Cbem., 5:382, 1994). Retroviral 
nucleocapsid proteins can fulfill a sunilar role (see, e.g., Gelfand et al. J. Biol. Cbem., 
268:18450-18456, 1993). 

15 Alternatively, compacting agents can be incorporated as an additional 

conqponent of the GDFP. Also, some sequence specific bindnig proteins, such as 
GAL4, which exlubit a range of bindiqg affinities to different cognate nucleic acid 
sequences may also be used in this capacity, and in Ais regard would function as an 
NBD with both nucleic acid binding and con^acdon properties. 

20 ConqMu:ting agents might also be incorporated as mediators of indirect bindiog 

between the tNA and the NBD domain of the GDFP (for exanq>le, the NBD domain 
can be bound to the contacting agent and the compacting agent bound to the tNA). 

AssemblvofGDFPs 

25 Preferably, the GDFP is prepared as a single polypeptide fusion protein 

generated by lecmnbmant DNA mediodology . To generate such a GDFP, sequences 
encodiiig the desired conqxments of the GDFP are assembled and fiagments ligated mto 
an esqnession vector. Sequences encoding the various components may be assembled 
from odier vectors encodipg die desired protein sequence, from PCR-generated 

30 fragments using cellular or Wral nucleic acid as template nucleic add, or liy assenibly 
of qrnthedcotigonudeotides encoding die deshnd sequence. However, all nucleic acid 
sequences encoding such a piefiBired GDFP should prefenbly be assembled by inrfinme 
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fusions of coding sequences. Flexons, described above, can be included between 
various components and domains in order to enhance the ability of the individual 
components to adopt configurations relatively independently of each other. 

Although a Type-I GDFP is pxeferably assembled and expressed as a single 
5 polypeptide chain, one or more of its domains or components may be produced as a 
separate chain that is subsequently linked to the GDFP by, e.g., disulfide bonds, or 
chemical conjugation. It is also feasible to prepare complexes in which domains such 
as the NBD and the ODD or their components are physically associated by other than 
recombinant means, either directly or indirectly, for exan^le, by virtue of non-covalent 

10 interactions, or via co-localization on a proteinaceous or lipid sur&ce. 

The GDFP may be expressed either in vitro, or in a prolcaiyotic or eukaiyotic 
host cell, and can be purified to the extent necessary. An alternative to the expression 
of GDFPs in a host cell is synthesis in vitro. This may be advantageous in 
circumstances in.ii^iich high levels of expression of a GDFP might interfere with die 

15 host cell's metabolism; and can be accomplished using ai^ of a variety of cell-ftee 
transcription/translation systems that are known in the art. GDFPs can also be 
prepared synthetically. It will likely be desirable for the GDFP to possess a con^nent 
or sequence that can facilitate the detection and/or purification of the GDFP. Such a 
conqx)nent may be Oe same as or different firom one of the various components 

20 described above. 

Many approaches of expressing and purifyii^ recombinant proteins are known 
to those skilled in die art, and kits for recombinant protein expression and purification 
are available from several commercial manufacturers of molecular biology products. 
Typically, an increased level of puri^ of the GDFP will be desirable. However, 

25 because of die specificiQr of the GDFP fbr nucleic acid bindiqg, die degree of 

purificati<m need not nec^saiily be extensive. The GDFPs of the present inventioo 

... may be fitmi^H |^ simple filtration through a 0.22 or 0.45u filter so as to avoid 

m jrwAfflf rjrwifnmifiafifln nf thg target ogflK. 

Since the domains of the GDFP can be assembled in modidar fashion in an 
30 expression vector, hs construction by recombinant DNA mediodology allows the GDD 
to consist of one or many con^Kments. Such components may have complementing 
activity in mediating or enhancing gene deliveiy, or diey may have closely related 
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fiinctioiis. In essence, the gene delivery domain can be viewed as possessing any 
function that mediates or enhances the efficiency of delivery of the tNA bound to the 
GDFP. 

Other Variations of GDFPs 

Other variations will be apparent to those of skill in the art. For example, the 
GDFP may itself be multimerized. Multimerization may be advantageous to increase 
avidity of binding of either the NBD or the ODD. A given tNA molecule may also 
coittain nmltqile distinct cognate recognition sequences, binding different Type I 
GDFPs with distinct fimctions, or the tNA may be bound with a mixture of Type I and 
l^pe n GDFPs. Additionally, certain components of the GDD, such as IN proteins, 
may require dimerization for optimal acthri^. Dimerization of the GDFP may be 
obtained by including, fiw example, a leucine zipper motif hi the GDFP. Such motifi 
ate commmi in DNA binding protems and ate responsible for their dhnerization 
(Kouzarides & Ziff, 1989). Leucine zqipers can be inserted into DNA binding proteins 
and cause fbem to dhnerize (Sellers and StniU. Nature 341:74-76. 1989). 
Multimerization of GDFPs can also be achieved, for example, by creating a 
recombinant fiision proteui that contains two or mote GDFPs. Preferably such 
multinwrized GDFPs are separated 1^ flexons, as described herein. Oflier 
oligomerization motifc ftom dimeric or multimeric proteins can similarly be employed. 

Uliiirtrfltioiis o f Tvncn Genft Dellvenr l^ision FrotdM 

Xype-n GDFPs do not bind targeted nucleic acids in a sequence-specific manner 
because the nucleic add binding components of Type-E GDFPs are aU derived from 
mideic acid bindii« proteins that are non-sequence-specific in their buiding to nucleic 
acid. 

f3»rM^ Acid ^iaMnc nmnaiw of Tvpe-H GDFPi 

The nucleic acid binding domains (NBDs) of lype-II GDFPl comprise binding 
components that ate derived 6om non-sequence-specifk: nucleic add biu 
recombinantly iiised to a gene delivety domain (GDD) as described above. 
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A number of non-sequence-specific nucleic acid binding proteins have been 
identified and characterized, including, for example, histones or polypeptides derived 
therefrom (see, e.g., von Holt, Bioassays 3:120-124, 1986; Rhodes, Nucleic Acid 
Res., 6:1805-1816, 1979; and Rodriguez et al., Biophys. Chem., 39:145-152, 1991); 
5 retroviral nucleocapsid proteins (see, e.g., Gelfand et al. J. Biol. Chem., 268:18450- 
18456, 1993); proteins such as nucleolin (Erard et al., Eur.*^ J. Biochem. 191:19-26, 
1990); avidin (Pardridge & Boado, F.E.B.S. Lett. 288:30-32, 1991); and polybasic 
polypeptide sequences such as poly-L-Iysine (Li et al.. Biochemistry, 12:1763-1772 
1973; Weiskopf and U Biopolymeis 16:669-684, 1977). 

10 For the reasons discussed herein, all of the GDFPs of the present invention are 

preferably produced as recombinant fusion proteins. However, the recombinant 
expression, in a host cell, of nonrsequence-specific nucleic acid binding con^onents in 
Type-n GDFPs (as well as in Type-I GDFPs that incoiporate sequeDce*non-speci&. 
nucleic acid binding components) may be hindered by mtfeiference of the expressed 

15 protems with host cell nucleic acids. In such situations, die GDFPs can be readily 

synthesized m vitro using any of a variety of cell-free transcription/translation systems 
thai are known in the art 

Gene Delivery Domains of Tvpe-H GDFPs 

20 The various possible sources of con^)onent8 making up the^gene delivery 

domains of Type-II GDFPs are essentially die same as described above for Typc-I 
GDFPs (although, by definition, Type-n GDFPs would not inchide sequence-specific 
binding components such as the sequenoe-specific integrase components described 
above for T>pe-I GDFPs)* 

25 . 

Targeted Wucteic Acids for Use with Tvne^n GDFPs 

The targeted nudeic acids to be combined widi Type-n GDFPs axe as described 
above except that they need not contain q^ecific lecQgnitibn sequences since the Type-II 
GDFPs bind nucleic acidb via non-specific in ter acti o ns. 



30 
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Assemblv of Tvne-n GDFPs 

The assembly of Type-II GDFPs is preferably via the synthesis of recombinant 
fiision proteins (see the description above regarding assembly of Type-I GDFPs). 

Using GDFPs «7f ^^^^ P^^ent Invention 

Thus, the GDFPs of the present invention can be used for in vitro or in vivo 
gene delivery. For therapeutic applications, target cells can be transduced ex vivo and 
letumed to a patient, or, given the biochemical nanue of the tNA/GDFP complex, cells 
can be treated directly in vivo. For such in vivo therapy, the con4>lexes can be 
formulated fbr a varied of modes of administration, inchiding systemic and topical or 
w^K^ administraticm. Techniques and fotmalati<ms may be found, for example, in 
Reminytnn's Pb ii^ac«iriail Sciences. Mack Publishing Co.. Easton. PA. (latest 
editioii). The tNA/GDFP conq)lex may be combnwd wittt a carrier such as a dihieat or 
excq>ieiit wtaOi may include, for example, filters, extenders, wetting agents, 
^i^nrtyaitf*. surface-active agents, or hdnicants. depending on (he nattre of the mode 
of administration and die dosage ftnms. The nanne of the mode of administration will 
depend, for exampte, on the location of die desired target ceUs. For in vWo 
administration, injection is preferred, inchiding ininmuscular, intranimoral. 
intravenous, intni-arterial (inchiding deUveiy by use of doiibte balloon calfaBteis), 
intraperitoneal, and subcutaneous. Delivery to hmg tissue can be accomplished by, 
e.g., aerosolization. For injectimi, the complexes of die mvemion are formulated in 
liquid sohidons, preferably in physiologically compatible buffers such as Hank's 
sohiticm or Ringer's solution. In addition, die conq>lexes may be formulated in solid 
form and redissolved <» suspended immediately prior to use. Lyophilized forms are 
also included. Systeinic adinmistration can also be by transinucosal <»; transdermal 
mean, or the compounds can be administered oralty. For ttansmucosal or transdermal 
administratipn, penetrants apptppriate to die barrier to be permeated are used in die 
formulation. Ftor topical administraikm. the complexes of die mvention may be 
formulated mto ointments, salves, gels or creams, as is general^ known in die art 

The GDFP qiproach can dnis be used to target any cell, in vitro, ex vivo or in 
vivo, die only requirement being diat the target cells have binding siitt for die GDFP 
on their surfooe. The present invention will thus be tisefol for many gene therapy 
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applications. As an illustrative example, the target cells that could be used in the 
context of the present invention include lymphohematopoietic cells. These include: (i) 
stem C5lls, which have many potential applications in gene therapy, including 
correction of hereditary disorders such as Gaucher disease and hemoglobinopathies, as 
5 well as genetic modification with intracellular vaccines against HTV such as decoys or 
dominant-negative proteins; and (ii) lymphocytes, which would allow genetic 
modification of effector T cells such as CTLs for use in human tiierapy with genes of 
interest such as suicide genes and regulated promoter cassettes. Also inchided for use 
in the context of the present invention are cells of Uie cardiovascular system TR^uch line 
10 blood vessels inchiding endothelial cells and vascular smooth muscle cells, which could 
be genetically modified to inhibit atherosclerosis or restenosis foUowing angioplasQr. 
Similarly, die piesert invention could be used to introduce genes into airway epithelial 
cells, such as tiie CFTR gene to correct cystic fibrosis. The present invention could 
also be used to transduce tumor cells and thereby genetically modify them to express 
15 suicide genes for tumor elimination or produce cytokines or express immunostimulatoiy 
molecules for use as a tumor vaccine in cancer patients. Another illustrative 
application of the present invention is delivery of DNA or RNA to antigen presenting 
cells (APCs). This could be useful, for example, to aUow expression of specific (tNA- 
encodecQ antigeu by an AFC, fbetdby allowing the AFC to stimulate an antigen> 
20 specific immune tespoEBe. such as a CTL response. Such an approach can be used in 
vitro, by transduction of APCs with a GDFP/tNA complex thereby aUowing antigen 
presentation for the stnnulation and generation of CTLs in vitro, or m vivo delivery 
can be used, to allow such antigen presentation in vivo. I^rect deliveiy of RNA to 
J^Cs ii«w«E die inesent invention may be espedalfy desirable for situations in which 
23 antigens are encoded transcrqtts Oat require special conditions for inttacettular 
transport or processing that may not happen efficient^ in dw AFC. An iUustrative 
example would be rev-dependem RNAs of HIV (such as HIV gag). Traiwhictioa of 
APCs with RNA in the context of die present invention can thus be used, fior exanq>le, 
to circumvent the need for niKlear ejqwrt of rev-dependent RNAs. Additionally, the 
30 present invention could be used to introduce genes into hepatocytes of die liver to 
correct genetic defects such as fiamilial hypercholesterolemia, hemophilia and odier 
metabolic disoideis. « to produce recombhiant products for qrstemic delivery. 
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Similarly, fibroblasts or connective tissue cells could be modified to secrete cytokines 
or soluble enzymes for immunomodulatory purposes or to correct a metabolic 
deficiency. These tissue targets and diseases, together with others are more fiilly 
described in Scriver et al.« Eds.« 'The Metabolic Basis of Inherited Disease', 6th Ed., 

5 McGraw-Hill, 1989, and in Miller, A.D., Blood 76:271-278, 1990. The present 

invention is particularly useful in cases in which genes of interest cannot be transferred 
by commonly used viral vectors, or in which the target cells are not infectable by viral 
approaches (see, e.g., Israel & Kaufman, Blood, 75:1074-1080, 1990; Shimotohno & 
Temin, Nature 299,263-268, 1982; Stead et al.. Blood, 71:742-747, 1988; and Bodine 

0 ' et al.. Blood, 82:1975-1980, 1993). 

The GDFP approach of the present invention can be used as a generically useful 
method for gene transduction of cells, and could be provided as a laboratoiy kit for 
gene transduction for use with, e.g., insect, avian, mammalian, or other higher 
eukaryotic cells. 

15 The transfer of genes in the present inventioii can also be facilitated by other 

biochemicals known to enhance Oe intake of nucleic acid by cells (see, e.g., Kawai ft 
Nishizawa, Mol. CeD. Biol. 4:1172-1174, 1984; Behr et al., P.N.A.S. 86:6982-6986, 
1989; Rose et al., P.N.A.S. Biotechniques 10:520*525, 1991; Pardridge ft Boado, 
F.E.B.S. Lett 288:30-32, 1991; Legendie ft Szoka, P.N.A.S. 90:893-897, 1993; 

20 Haensler ft Szoka, Bioconj. Cbem. 4:372-379, 1993). These and other techniques for 
use in the context of the present invention can be used under conditions (for incubation 
etc.) as described in the art (see, e.g., Kriegler, M. 1990 (ed.), **C3ene Transfer and 
Expression, a Laboratory Maraial," (1990)). In the case of GDFPs comprising pH- 
depcDdent M-D components, such as the TM protein of diphtheria toxin (see, e.g., 

25 Cboe et al. (1992) Nature 357:216-222), entiy of the GDFP/tNA complex into the cell 
can be conveniently achieved by singly reducing the pH of the incubation mednm 
duriqg transductioQ. 

The examples presented below are provided as a fimher guide to die practitioner 
of oidinaxy skin indieart,andarenottobe construed as Umit^g die invention in any 
30 wi^. 
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Example i 

Pre paration of a Nucleic Acid Binding Domain (NBD> 
From the Yeast GAL4 Protein 
The DNA binding domain of GAL4» amino acids 1-147 (Laughon and 
5 Gesteland, Molecular and Cellular Biology 4:260*267, 1984; Ma and Ptashne, Cell 

48:847-853, 1987; and Carey et al. , J. Mol. Biol. 209:423-432, 1989), was amplified 
by PCR from cerevisiae (ATCC 60248) using the following anqplimers. 

The amplimer for the 5* end of GAL4 was as follows: 
5* GCGC ACTAGT GCCACC ATG AAG CTA CTG TCT TCT ATC G 3\ 
10 The GAL4 codii^ region is underlined* This amplimer created a Spel site 

(ACTAGT) for cloning into pBhiescr^t (Stratagene) which allowed for subsequent 
transcripdon by T3 RNA polymerase. The zmplkocr also tnchided a consensus 
sequence (GCC ACQ for efficient protein translation located upstream of the initiator 
methionine (Kozak et al., NucL Acids Res. 13:3374, 1987). 
IS The an^limer for the 3* end of die GAL4 Nffi-terminus {up to amino acid 147) 

was as follows: 

5- GCGC GGTACC TCCGGA TAG AGT CAA C TG TCT TTG ACC 3 \ - 

The GAL4 coding region is underlined. This amplimer created a 3* Asp718 site 

(GGTACQ for cloning into pBluescrij^ as m>ted above. The an^limer also inchided a 
20 - BspEl site (TCCGGA) to allow for an in-frame fusion with an oligomer encoding a 

flexible peptide sequence (see below). 

The GAL4 fragment was amplified by 30 cycles of PCR directly from a colony 

of S. cerevisiae . The product was digested with Spel and Asp718 fmd ligated between 

the Spel and AspllS sites located in die polylinker icgion of pBhiescr^ Tbe 
25 construct was transfonned into the DHIOB stram of fi^ £qU by electroporation, and a 

colony containing die GAL4 fragmeiit was identffied by restriction eniyme analysis. 

The resulting plasmid, designated pT3gGAL4, is shown in Fig. 2A. 
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Preparation of a Gene Delivgrv nnm^\f^ 
From the Human IL-2 Protein 
A DNA fragment encoding mature soluble human IL-2 (amino acids 21-133) 
5 was amplified by PGR from a full-length human IL-2 cDNA (Taniguchi et ah. Nature 
302:305-310, 1983), using the following amplimers. 

The amplimer for the S' end of mamre human IL-2 was as follows: 
5' GCGC ACTAGT GCCACC ATG GCG CCT ACT TCA AGT TCT ACA AAG 

10 The IL-2 coding region is underlined. This amplimer created a Spel site 

(ACTAGT) for cloning into pBluescript, and inserted an initiator metfaionhie 
immediately upstream of amino acid 21 of IL-2. The amplimer also contained a 
consensus sequence (GCCACC) for efiRciem translation upstream of the inserted 
methionine, as noted above. A Narl site (GGCGCQ was also inchided which allowed 

15 for a subsequent in-frame fusion with a linker sequence which sq)arated the GAL4 and 
IL-2 domains in the GAL4/IL-2 cmstnict (see below). 

The amplimer for the 3* end of mature human IL-2 was as follows: 
5 • GCGC GGTACC TCA AGT CAG AGT ACT GAT GAT GCT TTG ACA AAA 
GGTAATC3\ 

20 This ampUma created an Asp718 site (GGTACC) for cloning mto pBluescript, 

and also retained the wild-type termination codon for human IL-2. A Seal site 
(AGTACT) at die 3' end of the IL-2 coding region was also created by this amqplimer 
wither introducing amino acid changes. The DNA fragment encoding the mature 
human ILr2 protein was amplified by 30 cycles of PCR from die fuU-lengdi human 

25 IL-2 cDNA referred to above. The product was digested widi Spel and Asp718, 
ligated mto pBhiescrqpt, and tnmsformed into DHIOB cells as ^d A 
colony harboring an appropriate construct was identified by restriction en^me analysis. 

Se^ienciqg of a plasmid derived from one colony revealed that an alteration 
Ooss of a single base resulting in a frame-shift near die terminus of lL-2) had occunred 

30 within die 3' aniplimer duriqg PCR cloniqg - diereby generating an IL-2 mutein. 
Specifically, die first T afior the Sea 1 site (in the first GAT trqilet) 
causuig a frame-shift diat also genentted a premature teimmation codon. As a result. 
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the S amino acids normally present at the temiinus were replaced by 3 different amino 
acids. This plasmid, referred to as "pTSmatlL-lm** (shown generically in Figure 2A as 
pT3matIL-2), was used to create a gene delivery fusion protein as described in 
Example 3. Despite the variation in the IL-2 domain, a GDFP based on this IL-2 
5 mutein exhibited IL-2 bioactivity, as described below. 

A second colony contained a plasmid designated "pT3matIL-2" (as shown 
generically in Figure 2A) that contained the expected wild-^pe IL-2 sequence, 
Plasmid pT3matIL-2 was used to create two GDFPs as described in Examples 3 and 4. 



10 * Example? 

Construction of Plasmids Encoding a Gene Delivery 
f^im Pn?teiq fqPFP) Saving a GDP an<| m ffflp 

' ^epm^ l?Y a FlCTon 
A DNA fragment encoding the nucleic acid binding domain (NBD) derived from 

15 GAL4 was isolated from pT3gGAL4 (Exanq)le 1) by digesting with Spel and BspEL ^ 
A DNA fragment encoding the gene delivery domain (GDD) derived from a human 
IL-2 mutein was isolated from pT3inatIL-2m (Exanq>le 2) by digesting with Narl and 
Asp718. The following oligomer pair encoding the flexon sequence 
(GlyGlyGlyGlySer), was annealed creating a 5' BspEl over-hang (CCGGA) and a 3* 

20 Narl over-hang (CGCQ: 

f CCGGA GGC GGT GGA TCC GGT GGT GGA GGC AGT GGA GGA GGT GGC ^ ^ 
TCGG3*; 

5' CGC CGA GCC ACC TCC TCC ACT GCC TCC ACC ACC GGA TCC ACC 
GCCT3\ 

25 The NBD and GDD fragments and the annealed oligomer were Ugated into 

pBhiescript between die Spel and A^TIB 8ites» and transformed into DHIOB celb is 
described above. A coloqy harborii^ a construct that contained aU ttiree fragments 
identified by its ability to hybridize to both GAL4 and 0^-2 pF)-labeled fra^nmits, and 
by restriction enzyme analysis, 
30 In the resulting plasmid, designated "pT3GAL4/IL-2m" (shown generically in 

Fig. 2A as pT3GAL4/IL-2), die sequence encoding the GDFP was inserted into 
pBhiescript m an orientation which allowed for sense RNA transcrqrts to be synthesized 



W09S/2M94 



PCT/US95/04738 



•50- 

with T3 RNA polymerase. The resulting RNA, when translated, incorporated both the 
DNA binding domain of the yeast GAL4 protein and the mature form of the human 
IL-2 niutein, in that order, separated by a flexible amino acid linker. 

A second plasmid, designated pT3GAL4/IL-2 (as shown in Fig.2A). was 
5 constructed exactly as described for pT3GAL4/IL-2m, except that the DNA fragment 
eiKoding the gene delivery domain (ODD) derived from human IL*2 was isolated from 
pT3matIL-2 (Example 2). 

Example 4 

10 Construction of a Third Plasmid Encoding a Gene Delivery 

Fusion Protein fGDFP^ Having a GDP and an NBD 

Sq)Wate(^ W » Ft^W 
Another e^qiression vector encodixig a GDFP derived from IL-2 and GAL4 was 
constructed as foUows. 

15 The DNA binding domain of GAL4t amino acids 1-147 (Carey, et al., supra), 

was amplified by 30 cycles of PCR from pT3gGAL4 usmg the following amplimers. 

The anq)limer for the S* end of GAL4 was as follows: 
5' GCGC GGATCC ATG AAG CTA CTG TCT TCT ATC G 3\ 

This amplhner created a BamHl site (GGATCC) immediately upstream of Met' 
20 to allow for an in-frame fusion widi a flexible peptide sequence in front of GAL4 (see 
below). 

The an^limcr for tiie 3' end of ^ GAI4 NII2-temiinu8 (iqp to amino acid 
147) was as follows: 

5 • GCGC GGTACC G CTA GCT TAC AGT C AA CTG TCT TTG ACC 3 • . 
25 This ampHmer created an AspllS site (GGTACC) for cloning into pBhiescript 

and also iiichided an engineered termination codon (CTA) at Ae C4^ 

DNA bindiiig domain of GAL4. 

To construct ppiL-2/GAL4« the GAI4 PCR product was digested 

andAsp718. A DNA fhigmem encodii^ human II/*2 was isolated fhmi pTSmaO^ 
30 (see Example 2) by digestuqg with Spel and Seal. The following oligomer pair 

encoding the amino acid sequence (GlyGlyGlyGlySer), was annealed, creatiiig a 5' 

Seal over-hang (ACT) and a 3' BamHl over*hang (GATCQ: 
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S' ACT CTG ACT GGA GOT GGG GGC TCT GGT GGC GGA GGT AGT GGA 
GGA GGT G 3*; 

5' GA TCC ACC TCC TCC ACT ACC TCC GCC ACC AGA GCC CCC ACC TCC 
AGT CAG AGT 3*. 

S The IL'2 and GAL4 fragments and oligomers were ligated into pBluescript 

between the Spel and Asp718 sites and the construct was transformed into the DHIOB 
strain of £^ £Qii by electroporation. A colony containing all three fragments was 
identified by its ability to hybridize to both GAL4 and IL-2 ["P]-labeled fragments, 
and by restriction enzyme analysis. 

10 In the pT3IL-2/GAL4 constnict, shown m Figure 2B, the GDFP was mseited 

into pBhiescript in an orientation which allowed for a sense RNA to be synthesized 
with T3 RNA polymerase. The resulting RNA, when translated, incorporated both the 
mature form of human IL-2, and the DNA binding doinain of the yeast GAL4 protein, 
in that order, separated by a flexible amino acid linker. 

15 ^ 

Expression of Gene Delivery Fusion Proteins 
Sense mRNA encoding the GAL4/IL-2m GDFP construct (described in Example 
3) was transcribed m vitro wiA T3 RNA polymerase firom the pT3GAL4/IL-2 vectw. 

20 Briefly, pT3GAL4/IL-2m plasmid was linearized witii Asp718 and this template was 
combined with a ribonucleotide mixture (rNTPs), RNA cap structure analog 
(mTGppp). and T3 RNA polymerase in a HEPES-based buffer (Promega "RiboMAX"). 
Afker incubatkm at 37 degrees C, the DNA template was digested with RNase-fiee 
DNase (Promega), and the syntiKsized mRNA was separated from unincoiporated 

2S rNTPs by diromatograpfay through a G2S Sephadex^in cohmm (Boehrii^ 
Mamdieim), precipitated witt^ 

Tlie resultant mRNA was translated in a cen-6ee rd^it re^ 
system. imRNA was added to a translation mixiuie of reticulocyte lysate» RNasin, and 
conq>lete amino acids ( P r om ega). Translation was allowed to proceed for 1 to 2 hr. at 

30 30 degrees C, after which lysates were stored at -70 degrees C. The integriQr and 
molecular weight of the fusion protein was assessed by UKhiding [^S)-labeled 
medbibiune (Amenham) in tiie translation mix, and visualizing tiie produa by 
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polyacrylamide gel electrophoresis under denaturing conditions. Fig. 3, lane 2, shows 
die [^^S]-labeled GAL4/IL-2m translation product as resolved on a 14% acrylaniide gel. 
The position of the GAL4/IL*2m GDFP translation product agreed with the predicted 
MW of 33kD. Molecular weight markers are shown in Fig. 3, lane 1, and a negative 
control is shown in lane 3. 

Sense mRNAs encoding the GAL4/IL-2 GDFP construct (described in Example 
3) and the IL-2/GAL4 GDFP constnict (described in Example 4) were transcribed jn 
vitro with T3 RNA polymerase from the pT3GAL4/IL-2 and pT3IL-2/GAL4 vectors, 
respectively, exactly as described above. Figure 6, lanes 3 and 4, show the 
["S]-labcled IL-2/GAL4 and GAL4/IL-2 translation products as resolved on a 4-20% 
gradient acrylamide gd. The positions of the E2/GAL4 and GAL4/IL-2 GDFP 
translation products agreed with the predicted MWs of 33.3kD and 33.2kD, 
respectively. Molecular weight nuuters art shown m Fig. 6, lane 1, and a hiciferase 
control is shown in lane 2. 

Scquence-Speeific DNA Bindmg Activity of GDFPs 
The ability of the GDFP& of Exan^le S to ei^ge in sequence-specific DNA 
binding was demonstrated by use of an electrophoretic mobiliv shift assay (EMS A) 
(Ausubel et al. (eds), "Current Protocols in MolecuUv Biology/ (1987 and 1993)). 
The target oligomer to which ibc GAL4 protein binds was: 
5* TCGACCWAGTACn'GTCCTCCGC 3* 
3' GCCTCATGACAC5GAGGCGAGCT 5\ 
The following target oligomer is not bound by GAL4 and was used as a 
negative control: 

5* TCGACTGAGTACTGTCCTGAGC 3* 
3' GACTCATGACACKJAGTCGAGCT S\ 
The GAL4 target oligomer was end-labeled using [^]-dCTP (Amersham) and 
Klenow polymerase (New England BioLabs). The htbeled oligomer was squrafed 
from unincoiporatBd nucleotides by chromatograidqr over a G25 spm cohmm 
(Bodiriiiger Mannheim)* and quantified by scintiliation oountiqg. This oligomer was 
added to reactions containiiig t HEPES-based buffer, which inchided 
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poly(dI-dC)«poly(dI-dC) (Phannacia), ZnClj, glycerol and BSA (Carey, et al.. J. Mol. 
Biol. 209:423-432, 1989; Chasman, et a!., Mol. and Cell. Biol. 9:4746-4749, 1989), 
and varying amounts of reticulocyte lysate containing either GAL4/IL-2m GDFP or 
IL-2. The reactions were clectrophoresed on a 4.5% polyacrylaniide/1% glycerol gel 

5 in 0.5x TBE. The gel was fixed in a methanol/acetic acid sohition, dried, and analyzed 
on a Molecular Dynamics phosphorimager . The results shown in Figure 4 show 
decreasing amounts of input GAL4/IL-2m fusion protein (lanes 1*3) showing specific 
interaction of the GAL4/IL-2m GDFP with the labeled target oligomer. DNA size 
markers appear in lane 4. In lane 5, the GAL4/IL-2m GDFP was incubated with 

10 labeled target oligomer, as in lane 1, but excess unlabeled target oligomer was also 

included and con^eted with the labeled targM oligomer for binding to die GAL4/IL-2m 
GDFP. In lane 6, the GAL4/ILr2m GDFP was incubated with labeled target oligomer, 
as in lane 1, but excess unhibeled iwn-binding oligomer was included, and showed lack 
of competition for binding of the GAL4/IL-2m GDFP to the labeled target oligomer. 

15 In lane 7, labeled target oligomer was incubated wifli lysate containing fauBum ILr2, 
and stowed no specific binding of the labeled target oligomer by either IL-2 or 
reticulocyte lysate components. The GAL4/IL-2m GDFP thus bound specifically to the 
cognate taxget sequence recognized by the GAL4 (NBD) domain of the G AL4/IL-2m 
GDFP. 

20 The lesuhs in Figure 7 show sequence-specific binding of the GAL4 protein and 

the IL-2/GAL4 and GAL4/ILr2 GDFPs. Taxget oligomers, binding conditions,* 
electrophoresis and gel treatment were exactly as described above, except that analysis 
was by autoradiogiapliy. The first four lanes contained decreasing amounts of input 
GAL4 protehit as indicated, showiqg specific interaction of GAL4 with the labeled 

25 target oligomer* The foUowmg lanes contained deonasing amounts of either inpitt 
GAM/11^2 GDFP or input n^2/GAI>l GDFP as indicated in The 
^^ff^gfiiitimi "+c* indicates Oat Oe GDFP was incubated widi labeled target oligomer, 
as in pitvipus tones, but excess unlabeled target oligomer vas also inchided in the 
leaction. The designatxm'4*m' indicates that the CDFP was incubated wtt^ 

30 taxget oligomer, as in previous lanes, but excess unlabeled ixinrbinding oligomer wis 
inchided in the reaction. The unlabeled taxget oligomer competed with the labeled 
taxget oligomer for bindqg to the GDFP vMt the non-bmding oligomer showed lack 
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of competition, demonstrating specific binding of the GDFP to the GAL4 recognition 
sequence. In the lane designated "IL2/ labeled target oligomer was incubated with 
lysate containing human IL-2, and showed no specific binding of the labeled target 
oligomer by either IL-2 or reticulocyte lysate components. The IL-2/GAL4 and 
GAL4/IL-2 GDFPs thus bound specifically to the cognate target sequence recognized 
by the GALA (NBD) domain of the GDFPs. 

I^Yfnlrine Bioactivitv of GDFPS 

GAlAflLrTm, GAL4/IL-2, and TLrl/GAlA fiiaon protems from m vj&a 
tianslatioiB (as in Example 5) were assayed for their ILr2 activities using the 
wen-known CTLLbioass^. Cells were incubited with the GDFP. then pulsed with 
3H-thymidine and incorporation of radioacthriiy into DNA was used as a measure of 
ceUttlar proliferation, as described by Gillis et al.. J. Immnnol. 120:2027, 1978. 

The results from the GAl>t/IL-2m GDFP are shown in Figure 5. The IL-2 
standard represents 1 ng/ml recombinant human IL-2 which was serially dUuted 1:3 ni 
the bioassay. The GAI>J/IL-2m GDFP curve was generated using in vitro translated 
material starting widi a 1:10 dihition of lysate. The bioassay shows retention of IL-2 
biological activity by die GAM/IL-2m GDFP. 

The results from the GAM/IL-2 and JLrVGAdA GDFPs are shown in Figure 8. 
The lLr2 standard was as described above. The GDFP curves were generated using ja 
vitro translated material staring wilh a 1:50 dihition of lysate. The bioassay shows 
retention of IL-2 biological acliviiy by the GAlAflLrl and IL-2/GAL4 GDFPs. 

r^wKtmerion a" 1 Qm^mr**^ GPFPff f^fnwinifMr ttie DmhAeria Toidn 

Trammembrane Region. 
The transmembrane (TM) domun of the D^hlheria toxin protein from £L 
diphtheriae . amino acids 205-378 (Choe. ct al.. Nature 357:216-222. 1992). is the 
region responsible for endosomal rdease of the catalytfc doihain of the toxhi into the 
cytoplasm of infected cells (Papini. et al., JBC 268:15^7-1574, 1993. Madsbns. JBC 
269:17723-17729, 1994). This region has also been shown to be capable of celtalar 
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membrane insertion in response to a mildly acidic environmeni (Moskaug, et al. JBC 
263:2518-2525, 1988, McGill, et al., EMBO 8:2843-2848, 1989). To incorporate this 
domain into the pre-existing GDFPs, two DNA fragments encoding the transmembrane 
domain of the Diphtheria toxin protein were amplified by PCR from C. diphtheriae 
genomic DNA, The first fragment, termed "DT," encoded amino acids 205-378 and 
was amplified using the following amplimers. 

The amplimer for the 5* end was as follows: 
5' GTAGATCTGGTGGAGGTGGCTCCGGAGGAGGT GGATCC fiAI ISfi fiAI 
GTC ATA AGO GAT AA 3' 

Hie amplimer for the 3* end was as follows: 
5' CTTC AGATCT GGATCC T CCA CCG CCA CTA CCT CCA CCC CCG QQA 
CGA TTA TAC G AA TTA TGA AC 3 ' 

The toxin TM sequences are underlined. Both amplimexs provided BamHl sites 
near the termini for subsequeitt cloning of the PCR fragment into BamHl-digest^ 
pT3IL-2/GAI>» (Example 4). 

The second fragment, tem^ "DAB," encoded amino acids 176-378, and 
provided additional residues at the amino tenninus of the transmembrane region. 
Within the context of the intact toxin protein d)ese additional sequences are involved in 
an enzymatic cleavage step whkh may be necessary for membrane fusogenic activity 
(Williams, et al., JBC 265:20673-20677, 1990, Ariansen, tx al.. Biochemistry 
32:83*90, 1993). The "DAB" fragment was amplified using die following amplimers: 

The ftTn piiwigr for the 5* end was as follows: 
5' GCG GGATCC GGT GGC GGA GGA AGT GAT GCG AZQ lAC SAfi IM 
ATGGCT C3^ 

The ftwi piimgr for the 3* end was the same 3' amplhner uscd to PCR the above 
described *DT' fragment 

The toxin TM sequences are underlined. The resulting "DAB" PCR fragment 
was digested with BamHl, and cloned mto die BadHl sites of bodi pT3GAL4/IL-2 
and pT3n/-2/GAL4 (Bcamples 3 and 4). 

Thus, three trq>leHlomain fusion plasmids were generated (pT3IL-2DTGAL4, 
pT3IL-2DABGAL4, and pT3GAI>IDABIL-2), each containing a version of the 
diphtheria toxin transmembrane region as the niiddle domain. Each of die three GDFP 
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RNAs was translated and the resulting fusion proteins were assayed for retention of 
IL-2 bioactivity and specific nucleic acid binding ability (Examples 5, 6, and 7). All 
three triple-domain GDFPs were found to retain these activities. 

Example 9 

Construction of a Targeted Nucleic Acid (tNA) 
The yeast transcriptional activator, GALA, has specific affmity for several 
closely related 17bp double-stranded DNA sequences, and it has also been shown to 
bind consensus synthetic 17bp target sequences with a similar affinity as it does 
wild-type sequences (Giniger, et al.. Cell 40:767-774, 1985; Bram and Romberg, 
P.N.A.S. 82:43-47, 1985). Target vectm were made by ligating oligomers containing 
a consensus 17bp sequence (Webster, ec al.. Cell 32:169-178, 1988; Carey, et al., J. 
MoL BioL 209:423-432, 1989) into die unique Sail site in the backbone of 
pDC3Q2CAT (Nfosley et al.. Cell 59:335-348. 1989; and Overdl et al., J. Imm. Mcth. 
141:53-62, 1991), a plasmid which directs die e^qiression of chloranqphenicol acetyl 
transferase (CAT) in manmialian host cells. The foUowing oligomers were used: 
5' TCGACGGAGTACTGTCCTCCGC 3' 

3' GCCTCATGACAGGAGC3CGAGCT 5\ 
The oligomer pair harbored an internal Seal site (TCATGA) which was used to 
screen resulting transformants for presence of die oligomer. In addition, there wm 
Sail-compatible over-haiigs at bodi the 5* and 3' ends, only one of which could 
regenerate die Sail site upon ligation. This feature was incorporated to allow die 
release of oligomer multimers for plasmid characterization. The oligomer pair was 
annealed by boiling equal amounts of each in a moderate salt buffer, then slow cooling 
the reaction. Tlie annealed oligomers were dien kinased with T4 polynucleotide kinase 
(Boehringer Mannheim) and ligaled to pDC3Q2CAT which had been linearized widi 
Sail. The constnicts were electroponded into DHIOBE,.^ and re^ 
were screened for dbe preseiKe of die target oligomer, or muhiniers diereof, by 
hybridizadon to a ra-labeled target oUgomer probe. Colonies harboring die target 
oligomer were fiirdier characterized for nuniber of copies by restricdon enzyme 
analysis and sequenciog. 
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Example 10 

Ability of GDFPs to Bind t o IL>2 Receptor-Bearing CTLL 
GAL4/IL-2 and IL-2/GAL4 GDFPs from in vitro translations (Example S) were 
funher demonstrated to bind CTLL (see Example 7) via the following assay. CTLL 

5 were incubated in IL-2-free medium for 2 hours or longer. ["S]-labeled GAL4/IL-2, 
IL-2/GAL4, IL-2. and GAL4 (Example 5) were incubated with the CTLL for 1 hour at 
4 degrees C in a binding medium containing 25mg/ml BSA and 2mg/ml Na-azide in 
RPMI-1640 buffered with 20mM HEPES. The binding medhmi was adjusted to a final 
pH of 7.2 prior to use. After binding, the cells were washed three times m ice cold 

10 PBS, and the final cell pellet was resuspended in a Tris buffer containing 15QniM 

NaQ, SmM EDTA, 0.02% Na-azide, and 0.5% Triton X*100 to gently lyse the cells. 
The lysate was spun briefly, and the supernatant was electrophoxtsed through a 4-20% 
gradient polyaciylamide gd. Figure 9 shows labeled protein present in die CTLL 
lysate and, therefore, associated with the CTLL. Lane 1 diows a molecular weight 

15 standard. Lane 2 shows the human IL*2 protein as present in the imreacted reticulocyte 
lysate (Example S), and lane 3 is the CTLL lysate after binding to IL-2. Lane 4 shows 
the GAL4/IL*2 GDFP as present in the unreacted reticulocyte lysate, and lane 5 is the 
(TTLL lysate after binding to the GAL4/IL-2 GDFP. Lane 6 shows the IL-2/GAL4 
GDFP as present in the unreacted reticulocyte lysate, and lane 7 is the CTLL lysate 

20 after binding to the JLrTJGALA GDFP. Lane 8 shows GAL4 as present in tbt 

MPrpi^ctf^ reticulocyte lysate, and bme 9 is the CTLL lysate after binding to GAL4. 
The GAL4/IL-2 and JLrl/GAlA GDFPs and IL-2 thus bind specifically to CTLL while 
GAL4 does not. This demonstrates that CTLL-specific bindmg of die GDFPs is 
niediated die IL-2 domain and not by the GAM >tomain. 




Ability of GAL4m^2 and IL-2/GAL4 GDFPs to Medmte Bindmg ofa Target 
Oligomer to IL-2 Receptor-Bearing CTLL 
GALMLrl and JLrl/GAlA fusion proteins from a ]diia translations (as in 
30 Exampte 5) were bound to pP]*dCIP««nd-labeled GAL4 tu 

in Example 6. Ite GDFP-tNA complex was bound to CTLL, as described in Exampte 
10, for 1 hour at 4 degrees C in binding medhm contamiqg 25mg/i]il BSA and 2Qig/ml 
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Na-azide in RPMI-1640 buffered with 20mM HEPES. The buiding medium was 
adjusted to a final pH of 7.2 prior to use. The cell-bound GDFP-tNA complex was 
separated from free GDFP-tNA by centrifugation of the binding mixture through a 
phthalate oil layer (Dower, et al., J. Exp. Med. 162:501-515. 1985). Cell-associated 
5 counts were quantified by scintillation counting. Figure 10 shows counts of labeled 

oligomer associated with CTLL as mediated by the GAL4/IL-2 GDFP, the IL-2/GAL4 
GDFP, GAL4, and a mgative control reticulocyte lysate designated "Bg.* The binding 
assay demonstrates the ability of both GAL4/IL-2 and IL*2/GAI>4 GDFPs to mediate 
binding of the oligomer tNA to CTLL. 

10 

Example 12 

Ability of the GA LA/TLr2 GDFP to Mediate Binding of a Target Plasmid to 1L^2 

Receptor-Bearing CTLL 
The GAL4/ni-2 GDFP (Exan^te 5) was bound tb die target plasmid usin^ 

IS bindiqg conditions described m Exanq>le 6. Tlie plasmid contained eigfal copies of the 
GAL4 17-iner target oligomer, as described in Exanq)k 8. The GDFP-tNA conq>lex 
was bound to CTLL for 1 hour at 4 degrees C in binding medium as described in 
Exan^le 10. CTLL wm Hbea washed three times in ice cold FBS, and die final cell 
pellet was resuspended hi a Tris buffer containing 15QmM NaCl, 5mM EDTA, 0.02% 

20 Na-azide, and 0.5% Triton X-100 to gently lyse the cells. The cell fysate was spun 
briefly, the supernatant was brought to 0.4N NaOH, and the san^)le was denatured at 
60 degrees C for 1 hour. The san^le was then applied via slot-blot onto GeneScreen 
Fhis membrane (NEN). The blot was screened for the presence of d^ target plasmid 
by hybridizadon to a [^]-labeled CAT probe. The membrane was washed, and the 

25 signal from cell associated plasmid was quantified by a phosphorimager (Molecular 
Dynamics). Figure 11 shows association of plasmid to CTLL me d i a t ed by either the 
GAL4/IL-2 GDFP or a ttgadve control reticulocyte lysate designated "Bg." The 
binding assay showed the ability of the GAL4/IL*2 GDFP to mediate bindiqg of 
plasmid tNA to CTLL. 

30 

iTtiifty The gene delivery fusion proteins of die present invention are useful in 
creating non-viral gene delivery systems for delivering a potynuckotide to i target cell. 
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Claims 

1. A fusion protein useful in delivering a targeted nucleic acid to a target cell» 
5 comprising a gene delivery fusion protein (GDFP), said GDFP comprising a nucleic 
acid binding domain (NBD) that binds to the targeted nucleic acid, fused to a gene 
delivery domain (GDD) that mediates delivery of the targeted nucleic acid to the target 
cell. 

10 ' 2. A fusion protein according to claim 1, wherein the targeted nucleic acid is a 

double-stranded nucleic acid. 

3. A fusion protein according to claim 1, wherein the targeted nucleic acid is a 
single-stranded nucleic acid. 

15 . • 

4. A fusion protein according to claim 1, wherein the targeted nucleic acid is 
DNA or an analog tbcteof. 

5. A fusion protein according to claim 1, wherein the targeted nucleic acid is 
20 RNA or an analog thereof . 

6. A fusion protein accordmg to daim 1, wherein die targeted nucleic acid is in 
the form of a recombinant expression vector comprising a nucleotide sequence to be 
es^ressed in the target cell. 

25 

7. A fusion proteuiaccordhig to claim 6, wherem die nucleotide se^ 
expressed is a nucleotide sequence Aat is not normally expressed in the targ^ 

8. A fiision protein accordmg to churn 6, wherein die nucleotide sequence to be 
30 expressed is an andsehte copy of a nucleotide sequence present in die target oeU. 
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9. A fusion protein according to claim 1, wherein said GDFP further comprises 
a flexible polypeptide linker sequence ("flexon") between said nucleic acid binding 
domain and said gene delivery domain or within one of said domains. 

10. A fiision protein according to claim 1, wherein said NBD comprises a 
nucleic acid binding component of a sequence-specific nucleic acid binding protein. 

11. A fusion protein accoiding to clahn 1, wherein said NBD con^rises a 
nucleic acid binding component of a sequence-non-specific nucleic acid binding protein. 

12. A fusion protein according to claim 1, wherein said NBD comprises a 
nndtiplicity of nucleic acid bindiiig (NB) components that bind one or more taigeled 
nucleic acids. 



15 13. A fusion protein according to claim 12, wherein said NBD comprises at 

least two NB components having differing binding specificities. 

14. A fusion protein according to claim 12, wherein the NBD comprises a first 
NB compovtat cq)able of binding to a specific cognate recognition sequence present in 

20 the targeted nucleic acid and a second NB conqx>nent capable of bindiqg non- 
specifically to the targeted nucleic acid. 

15. A fusion protein accordmg to claim 1, ixrtierein die NBD ftartfaer comprises 
a conqxment capable of mediating condensation and/or charge neutralization of die 

25 targeted nucleic add. . 

16. A fusion protein according to daim 1, wherein said gene delivery domain 
(GDD) comprises one or more conqxments that facilitate delivery of a targeted nucleic 
acid to a target cdl. 



30 



17. A fusion protein according to claim 16, wherem said components diat 
facilitate delivery of a targeted nucleic acid to a target cell are setoded from the group 
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consisting of a binding/targeting component, a membrane-disntpting component, a 
transport/localization component and a replicon integration component. 

18. A fusion protein according to claim 16, wherein said GDD comprises two 
5 or more components that facilitate delivery of a targeted nucleic acid to a target cell, 

said components selected from the group consisting of a binding/targeting component, a 
membrane-disniptmg component, a transport/localization component and a replicon 
mtegration component. 

10 19. A fusion protein according to claim 16, wherein said GDD comprises a 

binding/targeting catapoaoA. 

20. A fusion protein according to claim 16, wherein said GDD comprises a 
membrane disrupting conqxment. 

15 

21. A fusion protein according to claim 16, wherein said GDD comprises a 
transport/localization component. 

22. A fusion protein accordixQ to claim 16, wherein said GDD comprises a 
20 replicon integration component 

23. A fiision protein according to claim 22, wherein said replkon int^graticxi 
component is an int^grase enzyme or a deriivative thereof that retains mtegrase activity. 

25 24. A macromolecular complex useful m deliveripg a targeted nucleic add to a 

taiget cell, conprisuig a gene delivery fusion protem (GDFF) of daim 1 in association 
with a tatigeted nucleic add. 

25. A macromolecular con^tex according to claim 24, ixAierein said GDFP 
30 comprises a rq>licon integration component. 
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26. A macromolecular complex according to claim 25, wherein said replicbn 
integration component comprises a recombinase enzyme or a derivative thereof that 
retains recombinase activity, and wherein the targeted nucleic acid comprises NBD 
cognate recognition sequences in proximity to terminal recombinase recognition 

5 sequences. 

27. A macromolecular complex according to claim 26, wherein said 
recombinase is an integrase enzyme or a derivative thereof that retains integrase 
activity, and wherein the targeted nucleic acid comprises NBD cognate recognition 

10 sequences in proximity to terminal integrase recognition sequences. 

28. A recombinant polynucleotide useful for preparing a gene delivery fiisicm 
protein, said polynucleotide comprising a coding sequence that encodes a GDFP of 
claim 1. 

15 

29. The recombinant polynucleotide of chum 28, wherein said polynucleotide is 
in the form of an e3q>ression vector comprisiqg a transcriptional control region operably 
linked to said coding sequence. 

20 30. A cell useful in preparing a gene delivery fusion protein, said ceU 

containing an expression vector of claim 29. 

31. A method of usii% a recombinam polyniK:leotide of claini 29 to produce 
GDFP, said method coiiq)ri5ing the steps of causing the recombinant polynucleotide to 

25 be transcribed and translated, and recovering a GDFP. 

32. A method of using a GDFP of claim 1 to deliver said targeted nucleic acid 
to a target ceU, Ae method conq>risiqg the steps of contacting the GDFP witfi the 
targeted nucleic add to produce a GDFP/inicleic acid conq>lex and contactiiig said 

30 GDFP/nudeic acid complex widi the taigel cell. 



33. A ceU produced by the mediod of claim 32 and progeiqf di^^ 
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34. The cell of claim 33 wherein the targeted nucleic acid is expressed in the 
cell as an RNA molecule selected from the group consisting of an RNA transcript, an 
antisense RNA, an RNA decoy and a ribozyme. 

5 3S. A method of using a GDFP of clann 23 to deliver said targeted nucleic 

acid to a target cell, the method comprising the steps of contacting the GDFP with the 
targeted nucleic acid to produce a GDFP/nucleic acid complex and contacting said 
GDFP/micleic acid complex with the target cell. 



10 



36. A cell produced by the method of clahn 35 and progeny thereof, said cell 
con^rising an integrated copy of said targeted nucleic acid. 
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